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(57) A bstract: The invention relates to a method for identifying and validating plant targets for agrochemicals, comprising the steps 
£ of determining gene or protein expression profiles in function of the progression of an essential biological process in a plant, and the 
° subsequent deregulation of expression of said gene or protein in a plant cell. More particularly, the effects of downregulation of the 
O candidate target gene were directly monitored on plants locally infected with a vector mediating viral induced gene suppression in that 
> infected plant area. The invention also relates to isolated plant genes encoding proteins involved in plant growth and development. 
^ The invention also relates to planus tolerant to agrochemicals such as herbicides or pesticides. 
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IDENTIFICATION AND VALIDATION OF NOVEL TARGEtTfoT 

AGROCHEMICALS 
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e, ppm en, and to methods tor identifying and validating these genes/plins as £ 
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fait oen "I aPPr ° aCh " ndinfl ^'ochemioats woutd involve identification of 
target genes/prote.ns against which the agrochemical compounds could possibly work. For this 
process rdentfcation o, suitable target genes/proteins, the conventional methods make us 

rairco T °V h : ,ar9e ' ° rSaniSm - G6ne libra ' ieS are generally made as a 

rand m olteCon o, thousands o, gene knock-outs.ln these methods i, is investigated , the 
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detaenta, effect on the organism. The indication tha, said gene/protein is essentia, to the 
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based on the downregulahon of said gene or protein for example via anti-sense expression 
technology (WO0107601). expression 
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targat genes/proteins against which the agrochemical compounds could possibly work, (2) 
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validation of the candidate target gene as being an essential gene/protein for the organism and 
(3) use of these target genes/proteins in an in vitro screening procedure in which the chemical 
compounds are tested. 

It is the aim of the present invention to develop a process for the more efficient identification of 
5 candidate target genes/proteins for agrochemicals, combined with the more efficient validation 
of the target genes/proteins. It is a further aim of the invention to provide this process in order 
to design more efficiently the screening procedure with the agrochemical compound. 

The method of the present invention is based on the direct use of genetic information for 
10 example generated by expression profiling of the. candidate target genes/proteins, for the 
identification and the validation of the targets. 

Therefore according to a first embodiment of the present invention, there is now provided a 
method for identifying and validating plant genes/proteins as targets for agrochemicals, said 
15 method comprising the steps of: 

a. determining gene or protein expression profiles during a biological process of a plant or 
plant cell, said biological process being necessary for the viability or the growth of the 
plant or plant cell; 

b. selecting genes or proteins having altered expression during said biological process, 
20 c. cloning said selected gene or the nucleic acid encoding said protein in its full-length or 

partial form, 

d. incorporating said nucleic acid in a vector designed for downregulation of expression of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant 
cell. 

25 

The aim of methods of the present invention is the identification of target gene(s)/protein(s) out 
of a broad range of candidate plant genes/proteins. The identification step is achieved by the 
techniques of expression profiling described in the following embodiments. Since the method 
of the present invention can be used for identification of genes/proteins or proteins, the term 

30 "target" as used herein can mean a gene as well as a gene product, namely a protein, 
polypeptide or peptide. With the expression "target for an agrochemical" is meant a protein as 
well as a gene or nucleic acid encoding such protein, and when such target is inhibited, 
stimulated or otherwise disrupted in its normal activity by an agrochemical compound, this 
would lead to a desired effect in a target organism. The invention aims at efficiently identifying 

35 targets for agrochemicals. Said agrochemicals can be herbicides or pesticides as well as 
gTowth"stimulators"or~growth-regulators: 
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Targe rdenMcahon means select candidate ,ar g e,s from a larger nunrbJo, gTn«ns 
or prote ,ns on the oasis „, certain properties «ha, give such a molecuie a highe pra oa« y 0 
be,n a a suitable farge, than Cher molecules which do no, exhibi, said properties 

5 1 nonl? TTk 3 Pr0 ' ein ° r ' hat Whe " inhMed ' S,i ™' ated " disrupted in 

1 T * " 3 W ° Uld ki " ' he <""*> - -» a *ong negative 

I Z Z ZT saw compound wou,d ,here,ore be a candida,a *■ ~ 

bv a OT r tha ' WhSn inhib " ed ' S,imUlafed ° r ° thd ™ iSe di5 ^ ted » * norma, 
T DV 3 C °; P0Und W ° Uld ki " ,he »-* - a ^rong negative effect on its growth 
a, d compound wouid therefore be a candidate insecticide. A pian, growth regulator (PGR) 

aX' 8 : :T OT T ,ha ' inh ' bi,ed ' " " * o- 

achvrty by a compound woutd promote or alter in a desirable way the growth of plan, said 

compound would therefore be a candidate PGR. 

a N r a uXt 7 ? enomic informa,ion ' es - sene sequences ' 

and p ut «,ve ,unCona,„ y , is availabie from genomic sequencing and expression stedie! in 
evera, target organisms. „ is therefore of inferos, ,o develop a new meld to iden J d 
vacate genes/proteins as candidate ,arge,s for agrochemicals, such me,hods being bas d on 

z e o, r su : h 9enomic informa,ion - This use « 9en ° mic «* - ~ 

O his , Trt T SeleC " 0n " 3 Bnlited "« °' aPPr ° Priate ~* senes/prceins. 

effioency and success rate ot ,he screening procedure for agrochemicals. Purthermore the 
en., ,„ orma„o, e.g. ,he funCiona, date of ,he putative terge, gene/protein, is used as a 

col nd T" m °' S effiden " y * *° ~ 9 ~* * *• aurochemica, 
compound (s) under investigation. 

llZT T" * C ' 0SeS me,h ° dS ' hat a "° W f ° r ,hS - —» - <ar S e, 

proteins. „ therefore allows genes or proteins ,o be selected for the development of suitabie in 
wtro screenmg me,hods for the screening o, novel and efficient agrochemicais 

a~e d a b irSt SteP " me,h ° dS °' PreSen * ~ ° r ^ P"** 

are ,dent,„sd by us.ng transcrip, proving of the genomic content of a cell. By using this 

LTlTdTT ° MainS 9enmiC "* <S ™ " ~" as we 

fo nrr candida,e ,ar9et 9ene ° r 9ene <"***■ Th - «• * -fu, 

LrLdes 7 ^ SaleCti ° n °' P ° SSib,e a 8™*amical targe, genes/proteins, since i, 
tZ Toe 3 ' 8 S 9en ° mlC 3nd fUnC,i ° na ' data ° n ' he Candidate <"»* A good candidate 

el tia h ," 3 °' W " iCh eXPreSSl °" VarieS Si9 * artl ^ — »» — - - 
S b ' 0 '° 9 ' Ca ' ProCaSS « lh6 «* *- »- - an indication ,ha, ,he gene/protein is 
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involved in that biological process The present application describes for the first time that the 
determination of an expression profile of a gene during the progression of an essential 
biological process is used to identify possible agrochemical targets. 

5 The expression profiling in the target identification steps of the method of the present invention 
is carried out in function of the progression of a process that is essential for plant growth 
and/or plant development and/or plant viability. In one preferred embodiment of the present 
invention, the essential process that is monitored in the target identification step is the process 
of cell division. Accordingly, in a particular embodiment of the invention, the method to identify 
10 target genes/proteins for agrochemicals is based on the transcript profiling of genes/proteins 
that are specifically involved in cell division. Therefore the invention provides a method as 
mentioned above, wherein said biological process celj division, _ 

Other biological processes that may be monitored for the identification and validation of 
15 agrochemical targets are for instance processes that are essential for seed germination, leaf 
formation, etc. 

The term expression profiling means determining the time and/or place when or where a gene 
or a protein is active. Particularly for a gene, this is achieved by monitoring the level of 
20 transcripts and therefore in the case of gene expression profiling the term transcript profiling or 
mRNA profiling is used. 

Generally, the expression profiling in the methods of the present invention is carried out in 
function of the progression of a process that is essential for plant growth and/or development 
and/or plant viability. To achieve this, the process of interest is synchronized in a sufficient 

25 number of cells (for example in a cell culture) or organisms to allow collecting samples for 
expression profiling representing various stages of said process. Target identification then 
consists in selecting those genes or proteins that show significant changes in expression levels 
in function of the progression of the process of interest. It are those genes or proteins that are 
likely to be strongly involved or to be essential in said process. 

30 The term "essential" means that if the gene or the gene product cannot function as normal in 
the cell or organism, this will have significant implication in the cell growth or cell development 
or other vital functions of the cell or organism. 

According to the invention, the expression profiling can be studied at the level of m-RNA, using 
35 transcript profiling techniques, or alternatively at the level of protein, using proteomics-based 
approaches. 
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stages of the essential biological process. In a particular embodiment of the present invention 
the samples are obtained from cells that are synchronized for cell division. In a preferred 
embodiment of the invention expression profiling is done on synchronized dividing cells. 
Certain cell lines are particularly suitable for synchronization of cell division, for instance 
5 synchronization of tobacco Bright Yellow-2 cell lines as described in example 1. Therefore 
most preferably, the synchronized cells are tobacco BY2 cells. By using synchronized tobacco 
BY2 cells and performing a cDNA-AFLP-based genome-wide expression analysis, the 
inventors built a large collection of plant cell cycle-modulated genes/proteins. Approximately 
1340 periodically expressed genes/proteins were identified, including known cell cycle control 

1 0 genes as well as numerous novel genes. A number of plant-specific genes were found for the 
first time to be cell cycle modulated. Other transcript tags were derived from unknown plant 
genes showing homology to cell cycle-regulatory genes of other organisms. Many of the genes 
encode novel or uncharacterised proteins, indicating that several processes underlying cell 
division are still largely unknown. These sequences are presented herein as SEQ ID NO 1 to 

15 SEQ ID NO 785. 

While, according to the invention, the basic criterion for identifying an agrochemical target 
gene or gene product consists in the differential expression levels of the gene or the protein 
observed during the progression of an essential biological progress, secondary selection 
20 criteria can be used and combined with this primary criterion. 

■ 

One such secondary criterion may be to make a selection of genes or proteins that are found 
not to exhibit a high degree of homology with genes or proteins from other organisms (such as 
mammals) as this criterion is likely to reduce the probability that the agrochemical compounds 
25 active on the "plant-specific" target genes or gene products would also exhibit toxic effects 
against other organisms, for example mammals. 

Another secondary selection criterion could exist in focussing on a particular phase of the 
essential biological process as mentioned above. For instance, when cell division modulated 
genes/proteins are under investigation as potential agrochemical target genes/proteins, one 

30 could preferably use those cell division modulated genes/proteins which exhibit high 
expression during the G1 phase, S phase, G2 phase or M phase or at the transition stages of 
these phases. In one embodiment of the present invention, the focus may be on the G2/M 
transition phase, since this phase in the plant cell cycle is considered to have more "plant 
specific" elements than other phases of the cell cycle and is therefore more likely to yield plant 

35 specific candidate target genes and proteins. Whereas the core cell cycle genes/proteins and 
the basic regulatory mechanisms controlling cell cycle progression are conserved among 
higher eukaryotes, basic developmental differences between plants and other organisms imply 
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Further, according to a second step in the method of the invention, the candidate 
agrochemical target gene or gene product is subsequently validated as being essential for the 
growth and/or development and/or viability of the organism. This is achieved by cloning the 
identified candidate target gene in a vector construct designed to downregulate said target 
gene in a plant or plant cell, followed by inoculating the plant with this construct and monitoring 
whether downregulation of the gene results in negative effects on plant growth and/or 
development and/or viability. A valid target gene is a target gene that causes significant effects 
on growth of plants or plant cells when downregulated. The present application describes for 
the first time the use of a particularly fast and efficient downregulation method to validate 
possible agrochemical targets. 

Accordingly, the present invention encompasses a method as mentioned above for the 
identification and validation of plant targets for agrochemicals, wherein said downregulation 
involves a viral-induced gene silencing mechanism. 

* 

Thus, starting from a number of candidate target genes/proteins identified in the first step of 
the method of the invention, the target validation step aims at confirming and demonstrating 
the essential nature of the gene by demonstrating that severe down-regulation of the 
expression level of the gene has a significant effect on the organism. 

In particular, when one is interested in developing a screening assay for herbicides, 
downregulation of the candidate target gene in a plant may result in a lethal effect, a severe 
inhibition of plant growth or any other (obviously) negative phenotypic effects. Alternatively, 
when one is interested in developing a screening assay for plant growth regulators, the effect 
of downregulating the target gene may be modulation or even stimulation of growth in general 
or modulation or even stimulation of a particular process associated with plant growth and/or 
development and/or architecture and/or physiology and/or biochemistry or any other 
phenotypic effect. 

The man skilled in the art will be aware of various methods to achieve downregulation of a 
given gene or protein, such methods include essentially co-suppression based approaches or 
anti-sense based approaches as well as any other method resulting in gene silencing. Other 
examples of downregulation in a cell are well documented in the art and include, for example, 
RNAi techniques, the use of ribozymes etc. Gene silencing may also be achieved by insertion 
mutagenesis (for example, T-DNA insertion or transposon insertion) or by gene silencing 
strategies as described by, among others, Angell and Baulcombe, 1998 (WO 98/36083), Lowe 
et a/., 1989 (WO 98/53083), Lederer et ai, 1999 (WO 99/15682) or Wang et a/., 1999 (WO 
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numbers of genes/proteins. This can be achieved by using viral induces gene silencing 
mechanisms (VIGS) and by infecting the plant directly and locally, for example on the leaves. 
Therefore, according to another embodiment, the present invention relates to the use of a viral- 
induced gene silencing system for validating plant targets for agrochemicals. 

5 

This method for severe downregulation via transient expression of the gene in the presence of 
certain viral elements is referred to as "virus-induced gene silencing mechanism" (VIGS) and is 
previously described in Ratcliff et a/., Plant J., 25 237 - 245, 2001. Briefly, virus vectors 
carrying host-derived sequence inserts induce silencing of the corresponding genes/proteins in 

10 infected plants. This virus-induced gene silencing is a manifestation of an RNA-mediated 
defence mechanism that is related to post-transcriptional gene silencing in transgenic plants. 
Ratcliff „ef a/., developed an infectious cDNA clone of Tobacco rattle virus (TRV) that has been 
modified to facilitate insertion of non-viral sequences and subsequent infection in plants. This 
vector mediates VIGS of endogenous genes/proteins in the absence of virus-induced 

15 symptoms. Unlike the other RNA virus vectors that have been used previously for VIGS, the 
TRV construct is able to target most RNA's in the growing points of the plant. A more detailed 
description of this downregulation mechanism is given in example 2. 

According to particular embodiments of the present invention, the VIGS system is applied in 
20 Arabidopsis or in tobacco for the purposes of validation of a candidate agrochemical target 
gene. 

According to a further preferred embodiment, there is provided a method for validation of a 
candidate agriochemical target gene, wherein the gene is downregulated in a plant via the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 

25 

The present invention relates to a combination of the above-mentioned identification and 
validation steps, which are especially selected so that they lead to an efficient selection of 
candidate target genes for agrochemicals. The outcome of the transcript profiling provides the 
necessary information and forms the basis for the second step, namely the validation of the 

30 target gene via incorporation of the gene sequence in the downregulation construct. The 
combination of these two techniques is especially useful for selecting suitable target 
genes/proteins for agrochemicals in a high throughput fashion. This technique thus overcomes 
the technical limitations of previously described techniques such as the knock-out libraries and 
the antisense strategies without genetic information of the genes. This new combination offers 

35 a time-saving strategy for identification of a candidate target gene and the more direct 
information output in the form of a real sequence, the immediate cloning of the gene in the 
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Therefore, the present invention encompasses a method for screening candidate agrochemical 
compounds comprising the use of any of the methods as mentioned above. 

The invention may also be applied for the development of agrochemical (for example herbicide 
5 or pesticide) tolerant plants, plant tissues, plant seeds and plant cells. 

Herbicides that exhibit greater potency can also have greater crop phytotoxicity. A solution to 
this problem is to develop crops that are resistant or tolerant to herbicides. Crop hybrids or 
varieties that are tolerant to the herbicides allow, for instance, for the use of herbicides that kill 
weeds without attendant risk of damaging the crop. Further it should be clear that when a plant 
10 is overexpressing the target of a particular herbicide, the tolerance of said plant against said 
herbicide will also be enhanced. 

Therefore the present invention also relates to the use of the agrochemical (e.g. herbicide) 
target genes/proteins as identified by the method of the present invention for generating 
15 transgenic plants that are tolerant or resistant to an agrochemical (e.g. herbicide). Example of 
genes and gene sequences identified by the combined identification and validation methods of 
the present invention and which can be used as agrochemical target or that can be used to 
obtain herbicide tolerant plants comprise the sequences as represented in any of SEQ ID NOs 
1 to 785. 

20 These sequences are derived from tobacco, but the one skilled in the art can easily find via 
homology search in databases or homology search in a cDNA library the homologues genes of 
other plant species, for instance monocot sequences (e.g the corresponding rice or corn 
sequence), and use them for the same purposes as described herein. These homology 
searches can be done for example with a BLAST program (Altschul et a/., Nucl. Acids Res., 25 

25 3389 - 3402, 1997) on a sequence database such as the GenBank database. Homology 
studies as referred to above can be performed using sequences present in public and/or 
proprietary databases and using several bioinformatics algorithms, well known to the man 
skilled in the art. Methods for the alignment of sequences are well known in the art, such 
methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the algorithm of 

30 Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of two 
complete sequences that maximizes the number of matches and minimizes the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The software for performing BLAST 
analysis is publicly available through the National Centre for Biotechnology Information. 

35 

Further, some of the tobacco sequences identified by the method of the present invention 
might be partial but again, the full-length sequence can easily be found based on the partial 
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sequence. For example "transcript building" can be done based on homology search on EST 
databases, cDNA's or gene predictions. These databases and programs are pub.ic.y available 

e.g. http://www.tiq r.org/ 

Therefore the present invention relates to the use of the nucleic acids as identified and 
disclosed herein and represented in SEQ ID NO 1 to 785, and also to the use of the full length 
genes regenerated from the partial sequences as well as to the use of the homoiogues 
sequences isolated from the same or from other plants. 

In another embodiment, the present invention relates to a nucleic acid identified according to 
the method of the invention. Thus the invention encompasses an isolated nucleic acid 
identifiable by any of the methods as mentioned above. 

In another embodiment, the invention relates to a nucleic acid identified according to the 

method of the invention, comprising the nucleic acid sequence chosen from the group of SEQ 

ID NO 1 to 785 or a full length sequence thereof, or a functional homologue thereof or a 

functional fragment thereof, or an immunologically active fragment thereof. Thus the invention 

encompasses an isolated nucleic acid, comprising at least part of a nucleic acid sequence 

chosen from the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative 
thereof. 

With "a functional fragment" is meant any part of the sequence that is responsible for the 
b.olog,cal function or for an aspect of the biological function of the nucleic acid sequence. 

Further, the invention encompasses a method for the production of an agrochemical resistant 
Plant, comprising the use of any one or more of SEQ ID NO 1 to 785 or a homologue 
functional fragment or derivative thereof or one or more of the proteins encoded by SEQ ID NO 
1 to 785 or a homologue, functional fragment or derivative thereof. 

In one embodiment of the present invention the sequences, the full-length sequences and the 
homoiogues are used to develop herbicide tolerant plants. 

Further the invention encompasses a plant tolerant to an agrochemical, in which the 
expression level of one or more of the nucleic acids corresponding the SEQ ID NO 1 to 785 or 
the homologue, functional fragment or derivative thereof, is modulated. Further the invention 
encompasses any part or more preferably any harvestable part of these plants. 

Therefore the invention also relates to the use of these sequences, the full-length sequences 
and the homoiogues as targets for agrochemicals The invention encompasses the use of a 
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nucleic acid as mentioned above or the protein encoded by said isolated nucleic acid as a 
target for an agrochemical compound, preferably, wherein the agrochemical compound is a 
herbicide. 

5 Further, the invention relates to the use of these sequences to develop screening assays for 
the identification and/or development of agrochemicals. The invention encompasses a method 
for screening candidate agrochemical compounds comprising the use of any one or more of 
SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative thereof or one or more 
of the proteins corresponding to SEQ ID NO 1 to 785 or a homologue, functional fragment or 
1 0 derivative thereof 

The present-invention will be further illustrated by the following figures, wherein, 
Figure 1 shows the gene expression profiles obtained by quaiity-based clustering of all 
transcript tags monitored in a transcript profiling experiment as described in example 1. Shown 
15 are the trend lines of 16 clusters containing 97% of the genes and covering the entire time 
course as indicated on top. S-phase-specific gene clusters are grouped in A, gene clusters 
with peak expression between S- and M-phase are grouped in B, whereas group C contains 
the M- and G1-phase-specific clusters. D: Three small clusters of genes with peak expression 
during two cell cycle phases. 

20 

Figure 2 shows the phenotypes of tobacco plants inoculated with a acetolactate synthase 
(SEQ ID NO 18) downregulation construct and phenotypes of tobacco plants inoculated with a 
prohibitin (SEQ ID NO 21) downregulation construct. The phenotypes were observed 12 days 
after inoculation (upper panel) or 17 days after inoculation (lower panel). 

25 

Figure 3 shows the phenotype of tobacco plants inoculated with a B-type CDK (SEQ ID NO 
11) donwregulation contruct. The observations were made 37 days after inoculation. 

Figure 4 shows the sequences identified by the methods of the present invention and 
30 represented by SEQ ID NO 1 to SEQ ID NO 785 

EXAMPLES 
Example 1 

A cDNA-AFLP based expression profiling of sequence obtained from samples of a 
35 synchronized tobacco BY2 cell line system, was used to identify genes that are upregulated 
during the cell cycle, an essential biological process needed for the viability and growth of the 
tobacco cell line system. 
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CA). Exit from S-phase was observed 4 h after aphidicolin release and the level of synchrony 
was shown to be sufficiently high throughout the time course. 



RNA extraction and cDNA synthesis. Total RNA was prepared by using LiCI precipitation 
5 (Sambrook et al., 1989) and poly(A + ) RNA was extracted from 500 jig of total RNA using 
Oligotex columns (Qiagen, Hilden, Germany) according to the manufacturer's instructions. 
Starting from 1 \ig of poly(A + ) RNA, first-strand cDNA was synthesized by reverse transcription 
with a biotinylated oligo-dT 25 primer (Genset, Paris, France) and Superscript II (Life 
Technologies, Gaithersburg, MD). Second-strand synthesis was done by strand displacement 
10 with Escherichia coli ligase (Life Technologies), DNA polymerase I (USB, Cleveland, OH) and 
RNAse-H (USB). 

cDNA-AFLP analysis. Five hundred ng of double-stranded cDNA was used for AFLP analysis 
as described (Vos et al., Nucl. Acids Res., 23 4407 - 4414, 1995; Bachem et al., Plant J., 9 745 

15 - 753, 1996) with modifications. The restriction enzymes used were BsfY\ and Mse\ (Biolabs) 
and the digestion was done in two separate steps. After the first restriction digest with one of the 
enzymes, the 3' end fragments were collected on Dyna beads (Dynal, Oslo, Norway) by means 
of their biotinylated tail, while the other fragments were washed away. After digestion with the 
second enzyme, the released restriction fragments were collected and used as templates in the 

20 subsequent AFLP steps. The adapters used were: for BsM, 5'-CTCGTAGACTGCGTAGT-3' 
and 5'-G ATCACTACGCAGTCTAC-3' , and for Mse\, 5'-GACGATGAGTCCTGAG-3' and 
5-TACTCAG G ACTCAT-3' ; the primers for BstY\ and Mse\ were 
S'-GACTGCGTAGTGATCfr/CJNvrS' and 5'- G ATGAGTCCTG AGTAAN -i . 2 -3' , respectively. For 
preamplifications, a Mse\ primer without selective nucleotides was combined with a BsM primer 

25 containing either a T or a C as 3' most nucleotide. PCR conditions were as described Vos et al., 
Nucl. Acids Res., 23 4407 - 4414, 1995). The obtained amplification mixtures were diluted 
600-fold and 5 was used for selective amplifications using a P 33 -labeled BsfYI primer and the 
Amplitaq-Gold polymerase (Roche Diagnostics, Brussels, Belgium). Amplification products were 
separated on 5% polyacrylamide gels using the Sequigel system (Biorad). Dried gels were 

30 exposed to Kodak Biomax films as well as scanned in a phospholmager (Amersham Pharmacia 
Biotech, Little Chalfont, UK). 

Quantitative measurements of the expression profiles and data analysis. Gel images 
were analyzed quantitatively with the AFLP-QuantarPro image analysis software (Keygene 
35 N.V., Wageningen, The Netherlands). This software was designed for accurate lane definition, 
fragment detection, and quantification of band intensities. All visible AFLP fragments were 
scored and individual band intensities were measured per lane. The obtained data were used to 
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determine the quantitative expression profile of each transcript. The raw data were corrected 
for d,fferences in total lane intensities, after which each individual gene expression profile was 
variance-normalized . This was done as follows. 

The obtained raw data were first corrected for differences in total lane intensities which may 
occur due to loading errors or differences in the efficiency of PGR amplification with a given 
primer combination for one or more time points. The correction factors were calculated based on 
constant bands throughout the time course. For each primer combination, a minimum of 
10 invariable bands was selected and the intensity values were summed per lane. Each of the 
summed values was divided by the maximal summed value to give the correction factors Finally 
all raw values generated by QuantarPro were divided by these correction factors. 
Subsequently, each individual gene expression profile was variance-normalized by standard 
stat.st.cal approaches as used for microarray-derived data (Tavazoie et a/., Nature Genet., 22 
281 - 285, 1999). For each transcript, the mean expression value across the time course was 
subtracted from each individual data point after which the obtained value was divided by the 
standard deviation. A coefficient of variation (CV) was calculated by dividing the standard 
dev.at.on by the mean. This CV was used to establish a cut-off value and all expression profiles 
w.th a CV less than 0.25 were considered as constitutive throughout the time course. 
The Cluster and TreeView software (Eisen et a/., PNAS, 95 14863 - 14868, 1998) was used 
for hierarchical, average linkage clustering. Quality-based clustering was done with a newly 
developed software program (De Smet et al„ Bioinformatics 2002 May; 18(5): 735-46) This 
program is related to K-means clustering, except that the number of clusters does not need to 
be defined in advance and that the expression profiles that do not fit in any cluster are 
rejected. The minimal number of tags in a cluster and the required probability of genes 
belong.ng to a cluster were set to 1 0 and 0.95, respectively. With these parameters, 86% of all 
25 the tags were grouped in 21 distinct clusters. 

Characterization of AFLP fragments. Bands corresponding to differentially expressed 
transcripts were isolated from the gel and eluted DNA was reamplified under the same 
condrtions as for selective amplification. Sequence information was obtained either by direct 
sequencing of the reamplified polymerase chain reaction product with the selective BsrYl 
pnmer or after cloning the fragments in pGEM-T easy (Promega, Madison, Wl) or sequencing 
of .ndividual clones. The obtained sequences were compared against nucleotide and protein 
sequences present in the publicly available databases by BLAST sequence alignments 
(Altschul et a/., Nucl. Acids Res., 25 3389 - 3402, 1997). When available, tag sequences were 
replaced with longer EST or isolated cDNA sequences to increase the chance of finding 
significant homology. Based on the homology, transcript tags were classified in functional 
groups as shown in Table 1 . 
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Experimental Results 

Identification and characterization of cell cycle-modulated genes 

Tobacco BY2 cells were synchronized by blocking cells in early S-phase with aphidicolin, an 
5 inhibitor of DNA polymerase a. After the inhibitor had been released, 12 time points with an 1-h 
interval were sampled, covering the cell cycle from S-phase .until M-to-G1 transition. Flow 
cytometry and determination of the mitotic index showed that the majority of cells exit S-phase 
4 h after release from blocking and that the peak of mitosis is reached at 8 h. From each time 
point, extracted mRNA was subjected to cDNA-AFLP-based transcript profiling. 

10 Quantitative temporal accumulation patterns of approximately 10,000 transcript tags were 
determined and analyzed. In total, around 1,340 transcript tags were modulated significantly 
during" the~ cell xyclerHierarchieat clustering™ of-the-expression profiles resulted in four large 
groups with the peak of expression in S-, early G2-, late G2-, or M-phase. Within each of these 
groups, several smaller clusters of genes with similar expression patterns could be 

15 distinguished. By quality-based clustering 21 different clusters were identified (see: 
http://www.plantgenetics/genomics/CCMgenes). In agreement with the hierarchical clustering, 
the four largest clusters (clusters 1 to 4 in Fig. 1) correspond to the S-, early G2-, late G2-, and 
M-phases and together contain 65% of all the tags. An additional cluster (cluster 5 in Fig. 1C), 
not clearly separated in the hierarchical clustering, includes the genes with peak expression in 

20 G1 -phase and contains another 5% of the tags. The remaining clusters are much smaller and 
most often (e.g., clusters 6, 9, 10, and 18) include genes with a narrow temporal expression 
pattern. In addition to these clusters, three small groups of genes displaying elevated 
expression during two cell cycle phases were distinguished also by quality-based clustering 
(Fig. 1 D). 

25 After the transcript tags had been sequenced, homology searches revealed that 36.5% of the 
tags were significantly homologous to genes of known functions, 13.1% of the tags matched a 
cDNA or genomic sequence without allocated function, whereas for 50.4% of the tags no 
homology with a known sequence was found. Genes of known function belong to diverse 
functional classes (Table 1) revealing that several biological processes are at least partially 

30 under temporal transcriptional control during the cell cycle in plants. In general, the observed 
transcript accumulation profiles and cell cycle specificity correlate well with the functional 
properties of the corresponding genes. It is interesting that the number of transcription factors 
with G2-phase specificity is high, which may be related with the induction of genes involved in 
M-phase-specific processes. The overrepresentation of RNA-processing genes in the M-phase 

35 might indicate that post-transcriptional regulation is involved in gene activity during mitosis. 
Because de novo transcription is severely reduced during mitosis (Gottesfeld et a/., Trends 
Bioch. Sci., 22 197 - 202, 1997). RNA-processing could provide an alternative regulatory 
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differentially during the cell cycle. The transcripts are present during late M-phase and early 
S-phase, suggesting that CDKC is active during the G1-phase. 

In addition to these well-characterized cell cycle-regulatory genes, also several tags were 
identified herein derived from genes encoding transcription factors and protein kinases or 
5 phosphatases with a known or putative role in cell cycle control. One tag with a sharp peak of 
transcript accumulation 1 h before the B- and D-type cyclins corresponds to a 3R-MYB 
transcription factor. Recently, a 3R-MYB has been shown to activate B-type cyclins and other 
genes with a so-called M-phase-specific activator domain (Ito et a/., Plant Cell, 13 1891 - 
1905, 2001). Another tag peaking in M-phase is homologous to the CCR4 associated protein 

10 CAF. CAF forms a complex with CCR4 and DBF2, resulting in a transcriptional activator 
involved in the regulation of diverse processes including cell wall integrity, methionine 
biosynthesis and M-to-G1 transition (Liu et a/., EMBO J., 16 5289 - 5298, 1997). A majority of 
the tags with similarity to protein kinases and phosphatases show M-phase-specific 
accumulation (Table 1). Although the true identity and putative cell cycle related function 

15 remains unclear for the majority, one is highly homologous to a dual-specificity phosphatase. 
This type of phosphatases plays a crucial role in cell cycle control in yeast and animals 
(Coleman and Dunphy, Curr. Opin. Cell Biol., 6 877 - 882, 1994). Another M-phase-specific 
tag is homologous to prohibitin. In the mammalian cell cycle, prohibitin represses 
E2F-mediated transcription via interaction with retinoblastoma (Rb), thereby blocking cellular 

20 proliferation (Wang et at., Oncogene, 18 3501 - 3510, 1999). 

Protein degradation by the ubiquitin-proteasome pathway also plays an important role in the 
- control of cell cycle progression at both G1-to-S transition and exit from mitosis. Although there 
is little evidence for cell cycle-modulated expression of the genes encoding the various 
components of the ubiquitin-proteasome complexes, some proteins accumulate in a cell 

25 cycle-dependent way (del Pozo and Estelle, Plant Mol. Biol., 44 123 - 128, 2000). 
Furthermore, several tags were isolated herein from genes encoding ubiquitin-conjugating 
enzyme (E3), ubiquitin-protein ligase (E2), and proteasome components with an 
M-phase-specific expression pattern. Another transcript tag that accumulates during late 
M-phase is similar to cathepsin B-like proteins, which are proteolytically active and degrade 

30 diverse nuclear proteins, including Rb (Fu et a/., FEBS Lett., 421 89 - 93, 1998). 

Whereas all the core cell cycle regulatory genes have been identified that control the G2-to-M 
transition for which the expression is known to be cell cycle modulated, genes such as Rb and 
E2F, controlling G1-to-S transition were not found. These genes were probably missed 
because the G1-to-S transition was not included in the present analysis, what is supported by 

35 the finding that the early targets of E2F, such as polymerase a and ribonucleotide reductase, 
are already present at high levels at the beginning of the time course. 
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reappears at lower levels during M-phase and a third one is M-phase-specific. This latter 
expression profile has also been described for a RNR gene from Xenopus where the encoded 
protein appears to be involved in microtubulin nucleation (Takada et a/., Mol. Cell Biol M 11 
4173-4187, 2000). 

5 Numerous other transcript tags with S-phase specificity were found in addition to the ones 
involved in DNA replication and modification. Most interestingly, one of these tags is 
homologous to a mammalian gene encoding a TRAF-interacting protein (TRIP), which is a 
component of the tumor necrosis factor (TNF) signalling complex, and promotes cell death 
when complexed with TRAF (Lee et a/., J. Exp. Medicine, 185 1275 - 1285, 1997). Another 
10 S-phase-specific tag shows homology to the RING finger domain of inhibitor of apoptosis 
proteins, which are also involved in the TNF signalling pathway. 

Modulated expression of genes required for mitosis and cytokinesis 

Several paralogous genes that encode either a- or p-tubulin were highly induced and 
15 accumulated prior to the mitotic index peak or during early M-phase. The inventors found that 
in BY2, tubulin genes are highly cell cycle modulated. This transcriptional regulation is in 
agreement with previous demonstrations of de novo transcription of a- and p-tubulin genes 
during different cellular processes (Stotz et a/., Plant Mol. Biol., 41 601 - 614, 1999). In the 
present analysis, no y-tubulin genes were found, confirming published data that the amount of 
20 y-tubulin is constant in dividing BY2 cells (Stoppin-Mellet et a/., Plant Biol., 2 290 - 296, 2000). 
Most of the kinesins identified herein, fall in the same cluster as the tubulins peaking prior to 
mitosis. Interestingly, two tags have a distinct transcription pattern and appear in another gene 
cluster. Their window of transcript accumulation is very narrow and coincides with the peak of 
mitosis. Most interestingly, these tags correspond to the plant-specific 
25 phragmoplast-associated type of kinesin, PAKRP1 (Lee and Liu, Curr. Biol., 10 797 - 800, 
2000). A chromokinesin not yet described in plants was identified as well. This type of motor 
proteins use DNA as cargo and play a role in chromosome segregation and metaphase 
alignment (Wang et a/., J. Cell Biol., 128 761 - 768, 1995). 

Among the M-phase-specific kinases, two were unambiguously recognized herein as playing a 
30 role in cytokinesis. One is Aurora, a protein kinase with a key role in the control of 
chromosome segregation, centrosome separation, and cytokinesis in yeast and animals 
(Bischoff and Plowman, Trends Cell Biol., 9 454 - 459, 1999) but not described in plants yet. 
The other is NRK1, a mitogen-activated protein kinase kinase which is phosphorylated by 
NPK1, a kinase involved in regulating the outward redistribution of phragmoplast microtubules 
35 (Nishihama et a/., Genes Dev., 15 352-363, 2001). 
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because of their putative function in various key processes crucial for cell life, their expression 
at a certain developmental stage crucial for cell life, their role in metabolism and/or 
maintenance of cell living state. 

This example illustrates the validation of these candidate genes as novel targets for 
5 agrochemicals, via the technique of the virus-induced gene silencing (VIGS). 

Gene silencing mechanism 

The virus-induced gene silencing (VIGS) is a manifestation of an RNA-mediated defence 
mechanism that is related to post-transcriptional gene silencing (PTGS) in transgenic plants 

10 (Ratcliff et a/., Plant J., 25 237 - 245, 2001). The method uses a vector with an infectious 
cDNA of tobacco rattle virus (TRV) modified (see below) to facilitate insertion of target 
sequences and modified for efficient infection of plants (e. g. tobacco). The vector mediates 
VIGS of endogenous genes in the absence of specific virus-induced symptoms. 
The RNA-mediated defence is triggered by the virus vectors, and targets both the viral genome 

15 and the host gene corresponding to the insert. As a result, the symptoms in the infected plant 
are similar to loss-of-function mutants or reduced-expression mutants in the host gene. The 
presence of a negative growth phenotype suggests that the targeted gene is a potential 
herbicide target. 

The process of constructing a virus vector and monitoring symptoms on infected plants is 
20 completed within a few weeks, such that virus-induced gene silencing (VIGS) provides a 
simple, rapid means of assigning function to genes that have been sequenced but are 
otherwise uncharacterized. The determination of new herbicide target genes is performed in a 
few weeks including gene cloning, transformation steps and tobacco plant analyses. 
The TRV construct is shown to target host RNAs in the growing points of plants (Ratcliff et a/., 
25 Plant J., 25 237 - 245, 2001) such as meristems and actively dividing cells. 

It has been shown that this vector overcomes many of the problem features of PVX, TMV and 
TGMV. For example, the TRV vector induces very mild symptoms, infects large areas of 
adjacent cells and silences gene expression in growing points such as meristems and actively 
dividing cells. Infection of tobacco plants on the leaves with TRV based constructs will affect ' 
30 growth and development of upper parts of the infected leaves and allow screening for growth 
parameters. 

Construction of TRV vectors used in the validation process of the present invention 

TRV is a positive-strand RNA virus with a bipartite genome. Proteins encoded by RNA 1 are 
35 sufficient for replication and movement within the host plant, while proteins encoded by RNA 2 
allow virion formation and nematode-mediated transmission between plants (reviewed by 
MacFarlane, J. Gen. Virol., 80 2799 - 2807,1999). 
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host cell gene, corresponding to the cloned gene of interest. All experiments involving virus- 
infected material was carried out in controlled growth chambers. N. benthamiana plants were 
germinated ad grown individually on universal potting ground in pots at 25°C during the day 
(16h) and 20°C during the night (8h). 

_» 

. 5 

The plants were phenotypically evaluated on a daily basis. Particular attention was given to 
visible leaf damage and growth inhibition. The effects of the suppression of gene activity using 
the VIGS system is measured by the phenotypic aspect of the plants, including leaf defects 
such as growth retardation, yellow or necrotic spots, early senescence, etc. The effects of the 
10 downregulation of genes identified by the methods of the invention are also measured on the 
flower structure and the flowering capacities of the transformed plants. 

The severity of the phenotype is linked to the level of suppression of the geneactivity and 
indicates the degree in which the gene is essential for the plant Therefor the phenotype is an 
indication of the degree in which the gene is a valid target for a herbicide. 

15 

Phenotypes of the infected plants. 

1. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf growth modification, including leaf wrinkling, curling, wilting, 
leading to cell death and/or plant death. 

20 

2. Co-suppression of the geneleads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf yellowing or senescence, or cell deqth and necrosis, 
leading to plant death. 

25 3. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following phenotypic symptoms: chlorotic regions 
around infection, crisp or crunchy leaf texture around infection, numerous surface lumps on 
either leaf surface, abnormal trichomes, abnormal leaf size, reduced growth, reduced final 
size, altered vascular leaf system, altered water movement in leaf , leading to cell death and/or 

30 plant death. 

4. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following anatomical symptoms: clumps of modified 
cells on the surface of the leaf (either abaxial or adaxial), individual cells detached from the 
35 epidermis, swollen or modified trichome cells, modification of leaf tissue structure, cell size, cell 
number, tissue composition, parenchyme, epidermis, etc , leading to cell death and/or plant 
death. 
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Table 1. Functional classification of transcript tags 



Function 



Tags S 

27.7% 



G2 

1 5.8% 



M 

52.9% 



G1 
3.6% 



Cell cycle control 


30 


5/8 (0.078) 


8/5 (0.068) 


14/16(0.114) 


3/1 


Cell wall 


35 


6/10 (0.047) 


4/6(0.136) 


25/18 (7.1 e -3 ) 


0/1 


Cytoskeleton 


43 


1/12 (1.26- 5 ) 


4/7 (0.090) 


38/22 (2.1 e" 7 ) 


0/2 


Hormone response 


13 


6/4 (0.113) 


1/2 (0.277) 


6/7 (0.185) 


0/0 


Kinases/Dhosohatases 1 


27 


4/8 (0.039) 


1/4 (0.059) 


19/14(0.025) 


3/1 


Protein synthesis 


50 


15/14(0.116) 


5/8 (0.087) 


29/26 (0.079) 


1/2 


Proteolysis 


21 


2/6 (0.026) 


1/3(0.144) 


1 7/1 1 (0.039) 


1/1 


Replication and modification 


74 


57/20 (4.2e' 19 )8/12(1.0e -5 ) 


8/39 (1.0e" 18 ) 


1/3 


RNA processing 


20 


1/6 (6.8e-3) 


1/3 (0.137) 


18/11 (8.1 e 4 ) 


0/0 


Signal transduction 


10 


1/3 (0.121) 


3/2(0.201) 


6/5 (0.205) 


0/0 


Stress response 


20 


6/6(0.192) 


2/3 (0.229) 


10/10(0.159) 


2/1 


Transcription factors 


27 


4/8 (0.039) 


10/4 (3-Oe- 3 ) 


12/14 (0.112) 


1/1 


Transport and secretion 


31 


5/9 (0.047) 


2/5 (0.076) 


21/16 (0.031) 


3/1 


Unknown 


175 


37/48 (0.015) 


19/28 (0.014) 


112/93 {Z.Ze*) 


7/6 



The total number of tags and the observed/expected number of tags within the different cell 
cycle phases for each functional group is given together with the probability values between 
parentheses as calculated based on the binomial distribution function, except for the G1 -phase 
because the values were too small. A significant enrichment (P<e 3 ) of tags of a functional 
group within a particular cell cycle phase is indicated in bold. 

1 Only kinases and phosphatases with unknown biological function. 

2 Except small GTP-binding proteins, which are classified under signal transduction. 

Table 2: overview of group 1 of sequences used for validation of candidate target genes 



SEQID NO | 


CDS NO 


Tag Name 


Function 


hase 


1 


2216 


18R1850 C4-32-33 1E2 


catalase 


* • 


2 


2217 


Bstt2-31-215 


phytoene desasturase 


• » 


3 


2218 


Bstc13-1-145 


L-ascorbate peroxidase 


VI-G1 


4 


2219 


Bstc21 -4-280 


GTP-bindinqprotein 


M 


5 


£220 


Bstc33-2-310 


vacuolarsortingreceptor . _ 


M 




2221 


Bstc4-34-170 


probable cinnamyl alcohol dehydrogenase 


G1/S-S: M-G1 


7 


2222 


Bstt34-3-470 


kinesin , . 


M 


8 


2223 


Bsttl 2-3-410 


B-typeCDK . _ 


M 


9 


2224 


Bsttl 4-3-458 


squalene mono-oxygenase 


G1/S-S 


10 


12225 


Bsttl 2-1 -230 


kinesin-iikeprotein 


M 


11 


2226 


Bstt23-4-230 


B-typeCDK 


M 


12 


|2227 


Bstt2-42-225 


B-typeCDK 


M 


13 


|2228 


Bstt31 -4-208 


arabinogalactan protein precursor 


G2/M-M 


14 


|2229 


Bstt 3-41-205 


larabinogaiactan protein precursor 


G2/M-M 
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BY?-AAnfiQi in ^Jr^~ w*m pnospnatase 2A reaulatorvph^T 



0624 



0626 
0626 



IS 



— - - i a uauuill, ^/pp — 

RY?.AAn7orm.nnn^ . : . . ' ™' 



■ - — w»_ w/ / . i nou l I / DO Is 



0629 



nnmnlomnn4nl, ~~T^r^: — — ' 



h^iu7I7r~r^ ■ v ^D^DDoy. i AL3b0314, N. 

feS^ 

^o_yW4 BY2- AA095M08 nrntp n L- i n ^ r> ^ i 1 ~ — : — , 



44 

_ 

45 



0635 



46 



47 



48 



0636 



imE y ° AG Pf ° tein Arafrdopsis BAA970637TAP00 0370 N tabacum 

Id AADQ1 ri'tfinOt cnc m _■: ! . 



0637 



0638^1 



49 



0638 2 




f t. •;> A a i i .i i 71 r v ' la pacum 89 1 hp ■ — . 

^^^^^^^^^^^ 



3Y2-AA04iPn/i ^r,;rri-™^-,,r; 



Ni cotiana tabacum? '^o_i (AF1 13545) vacuole-associated annexfn VCaB42 

BY2-AA044CU2 >dbj)BAA02Q2fi 1 1 /nnyi7m n — 

BY2 ^AA044L14 dbilBAA 9731oT t /adh^- — . 

PR OTEIN AKCS9 _ DO ' P ^ V, GUN PROBABLfc NONSPECIFIC LiPJD-TRANSFER 

BY2-AAOidRnnRr ^QCm ~i u.'in/, ■' V — ■ 



PROTEIN AKCS9 

*~ aSSdUAK0l6659^ p utative \Mus musculus 



29 



WO 03/085115 PCT/EP03/03703 



fin 


JODO 


RY9-AAn4fipnfi nirliT^n^Rfi stamina nicitiNniriia nrotein StD nmDOrtecn - aarden pea 


□ I 




BY2-AA046G14 dbj|BAB26082.1 (AK009117) putative [Mus musculus] 




L/vj r U 


BY2-AA046H23 emb|CAA98172.1| (Z73944) RAB8A fLotus japonicus] 


63 


0671 


BY2AA048A05 qbjAADI 5504.1 1 (AC006439) putativeAAA-type ATPase [Arabidopsis thalianal 




UO / c. 


BY2-AA049K03 dbj 


BAB24909.1 (AK007240) putative [Mus musculus] 


DO 


UO/ O 


BY2-AA051A10 dbj 


BAB02543.1 (AP000417) mitotic checkpoint protein [Arabidopsis thaliana] 


bo 




BY2-AA051L22p19 
PF00400 -WD doms 


•!03gb|AAD48948.1|AF147262_11 (AF147262) contains similarity to Pfam family 
»n 


D/ 


UD/0 


BY2-AA052E10 >qb AAF52905.1) (AE003628) CG4968 gene product [Drosophila melanogaster] 


CO 

bo 


nc7c 
UD/O 


BY2-AA052F14 >gb|AAF79819.1 |AC007396 20 (AC007396) T4012.22 fArabidopsis thalianal 






BY2-AA052G16p19D04 >dbj|BAB09843.1| (AB005246) geneJd:MUP24.12~unknown protein 
[Arabidopsis thalianal 


70 


0678 


BY2-AA052N17>gt 


> AAG42914.1|AF327533 1 (AF327533) unknown protein [Arabidopsis thaliana] 


n 


uo/y 1 


BY2-AA053C11.1 >< 


dbilBAB22857.1| (AK003561) putative [Mus musculus] 


70 


n.£57Q 

uo/y 


2 BY2-AA053C11.2 >qb|AAC62883.1| (AC005397) hypothetical protein [Arabidopsis thaliana] 


/ 0 


ubou 


B Y2-AA062A09 >gb|AAF01 061 . 1 |AF 1 89284_1 (AF189284) nucleolar G-protein NOG1 [Trypanosoma 
bruceil 


7A 


UDO 1 


BY2-AA062G03 >pir||T02135 hypothetical protein F8K4.10- Arabidopsis thaliana 


7c 




BY2-AA065E08 >pir|lT00795 hypothetical protein F24L7.13 - Arabidopsis thaliana 


76 


0683 


BY2-AA072K18 >emb|CAB40381 .1| (AJ010819) GrpE protein [Arabidopsis thaliana] 


"7*7 

77 


ubo4 


BY2-AA075K12 >gb|AAD31331 .1 IAC007354 4 (AC007354) T16B5.4 [Arabidopsis thaliana] 


78 


Uboo 


BY2-AA076N08 >dbj|BAA94770.1| (AP001859) ESTs AU082761 (S5084) D42006 


79 


0686 


BY2-AA080D01 >gb|AAF80646.1|AC01 2190_2 (AC012190) Contains similarity to F28016.19 a putative 
translation initiation protein 


80 1 


Ubo/ 


BY2-AA081P14 >gb|AAD32777.1|AC007661 14 (AC007661) unknown protein [Arabidopsis thaliana 


81 


0688 


BY2-AA082H04p21 F02 >dbj|BAB10171.1| (AB016880) genejd:MTG10.12~pir||T05795~strong similarity 
to unknown 


82 


0689 


BY2-AA082H06p21G04 >pir||T09039 hypothetical protein F26K10.110 - Arabidopsis thaliana 


83 


0690 


BY2-AA082M07p21B05 >dbj|BAB01 783.1) (AB022215) gene_id:MCB17.19~unknown protein 
[Arabidopsis thaliana] 


84 


0691 


BY2-AA083B24p21C04 >dbjlBAB08247.1 1 (AB006698) genejd:MCL19.6~unknown protein [Arabidopsis 
thanliana) 


85 


0692 


BY2-AA083C05p21D02 >gb|AAH02924.1|AAH02924 (BC002924) Unknown (protein for 
IMAGE:3956179) [Homo sapiens] 


86 


0693 


BY2-AA085D08p21C05 >pir||T47624 hypothetical protein T5N23.10 - Arabidopsis thaliana 


87 


0694 


6Y2-AA085F09p21H01 >qb|AAF79503.1|AC002328 11 (AC002328) F20N2. 15 [Arabidopsis thaliana] 


88 


0695 


BY2-AA085M15p21D04 >gb|AAF97305.1 |AC007843_8 (AC007843) Unknown protein [Arabidopsis 
thaliana] 


89 


0696 


BY2-AA088K23p21G05 >gb|AAG52001.1 |AC012563_1 1 (AC012563) unknown protein; 64612-65506 
[Arabidopsis thaliana] 


90 


0697 


BY2-AA088L24p21A07 >gb)AAD55292.1|AC008263_23 (AC008263) Contains PF|00249 Myb-like DNA- 
bindinq domain. 


Q»l 

a 1 


uoyo 


BY2-AA089F12p21H05 >gb|AAD55274.1|AC008263_5 (AC008263) Strong similarity to gb|D21805 
calcium-dependent protein kinase 


DO 


neon 

ubyy 


BY2-AA089M17 >pir||T02186 hypothetical protein F14M4. 16 - Arabidopsis thaliana 


93 


0700 


BY2-AA090J23p21G08 >pir||T48545 hypothetical protein F14F18.30 - Arabidopsis thaliana 


94 


0701 


BY2-AA092F12p21H06 >emb|CAB46854.1 1 (AJ388555) hypothetical protein [Canis familiaris] 


95 


0702 


BY2-AA092L20p21E07 >qb|AAD1 0646.1 1 (AC005223) 45643 [Arabidopsis thaliana] 


yb 


0703 


BY2-AA093J23p21C1 1 >gb|AAG51461.1|AC069160_7 (AC069160) unknown protein lAraoiaopsis 
thaliana] 


Q7 

y / 


C\~7C\A 


BY2-AA093L18p21D09 >emb|CAC1 5504.1 1 (AJ297917) B2-type cyclin dependent kinase [Lycopersicon 


yo 


U/UD 


BY2-AA093M19 >qb|AAG1 2535.1 (AC01 5446 16 (AC015446) Unknown protein [Arabidopsis thaliana] 


99 


0706 


BY2-AA094B12p21F10 >dbj|BAB02118.1| (AP000381) contains similarity to unknown 


1 r\r\ 

luu 


0707 1 


BY2-AA096G05p21A11 dbj|BAB02118.1| (AP000381) contains similarity to unknown 


101 


0707 2 


lei AA094B12p21F10 


102 


0708 


BY2-AA097G22p21D10 >gb|AAG60065.1 |AF 33791 3__1 (AF337913) unknown protein [Arabidopsis 
thaliana 


103 


0709 


BY2-AA099F04 gb|AAG52457.1 |AC010852_14 (AC010852) hypothetical protein; 12785-11538 
[Arabidopsis thaliana] 


104 


0710 


BY2-AA099N08p21H09 qb|AAK14411.1|AC087851 3 (AC087851) unknown protein [Oryza sativaj 


105 


0711 


Id AA100B09 ref]NP 009820.11 Ybr261cp fSaccharomyces cerevisiae] 


106 


0712 


BY2-AA109N02 ref|NP_002848.1| peroxisomal farnesylated protein; Housekeeping gene o3kD [Homo 
sapiens . . 


107 


0713 


BY2-AA1 14E09p22F02 pir||T51434 hypothetical protein F2G14 10 - Arabidopsis thaliana 


108 


0714 


BY2-AA115B14p22C02 dbj|BAB08888.1| (AB012243) gene_id:MIJ24.6-ref|NP_013897.1~similar to 
unknown protein 
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109 



110 



11 



112 



0715 



PCT/EP03/03703 



0716 



0717 



SS^SSSs?' 1 ^ 2 ^™ 3900 - 1 ,MH03900 < BC003900 > Similar * h ^e<ical protein 



0718 



gg^K j H HKu I LASOMt SUBUN.T ALPHA TYPE2(20S 




faction ° VerVieW ° f 9r ° UP 3 se£ * uences *at show homology with proteins of unknown 




124 [Bstcl 1-3-1 90 



125 
126 



27 



128 



129 



131 



Bstc11-3-215 
Bstd 1-3-230 



unknownprotein 



mtafiveprotein 



G2/M-M-G1 



M-G1 



G2/M-M-G1 



G1/S; M-G1 



h3g_ Bstc2-21-182 unknown 



j137 Bstc22-1-275 



138 Bstc2-22-100 



139 Bstc2-22-155 



146 



147 



Bstc14-3-165 
Bstcl-43-250 



Bstc1-4 3-310 



Bstc2-22-240 
Bstc22-2-270 



■142 Bstc2-23-135 



1£3_ B_stc2-23-220 un k no wn 



unknown 
unknown 



M-G1 
G2/M-M-G1 



hypotheticalprotein G2/M-M -G1 



unknownprotein 
unknown 



145 Bstc2-31-?ftn 



14jLBstc22-4-215 hypot heticalprotei n 1G2/M-M 

1 A £l D r^i^n /"i A r\r\r^ 1 — 



148 



Bstc23-2-33Q 
Bstc23-2-370 



149 



unknown 



G2/M-M 



M-G1 



G2-M-G1 



G2-G2/M 



32-M 



G2/S-G2-M 



G2-M-G1 



Bstc2-32-400 



150 



151 



152 



153 



154 



Bstc23-3-270 



>utativeprotein 



Bstc2-33-280 



G2/M-M-G1 



M 



M 

G1/S-S; 
G2/M-M-G1 



G1/S-S; 
G2/M-M-G1 



Bstc2-34-120 
Bstc23-4-300 



Bsfc2-41-165 



unknown 



155 Bstc2-42-1QQ 



156 



157 



158 



159 



Bstc2-43-21 0 
Bstc31-185 



Bstc3~12-145 



G1/S-S; M-G1 



unknownprotein bl/S-S;M-G1 



unknown 



G2/M-M-G1 



unknown 



G1/S-S 



M 



G1/S-S 



unknown 



Bstc3-1 2-290 



unknown 



unknown 



M-G1 



G2/M-M-G1 



S-G2 



G2/M-M-G1 



SE( 

ID 

NO 


3 Tag name anc 


i Function 


Fase 


160 


Bstc31-3-400 


unknown 


G2/M-M-G1 


161 


Bstc32-1-122 


unknown 


M-G1 


162 


Bstc3-2 1-125 




G1/S-S; 
G2/M-M-G1 


163 


Bstc32-2-150 


putativeprotein 


G1/S-S; 
G2/M-M-G1 


164 


Bstc32-4-193 






165 


Bstc32-4-370 




G1/S-S-G2/S; 
M-G1 


Q66 


Bstc3-3 1-350 


putativeprotein 


G1/S-S-G2/S 


R67 


Bstc33-2-145 


hypotheticalprotein 


G1/S-S; 
G2/M-M-G1 


168 


Bstc3-33-350 




G1/S-S 


169 


Bstc33-360 


putativeprotein 


G2/M-M-G1 


170 


Bstc33-4-270 


unknown 


G2/M-M 


171 


Bstc3-4 1-270 


unknown 


M-G1 


172 


Bstc3-41-300 




G2/M-M-G1 


173 


Bstc3-4 1-360 




G2/M-M-G1 


1 /4 


Bstc3-42-1 75 




M-G1 


175 


Bstc3-43-135 




G1 


176 


Bstc3-43-180 




M-G1 


177 


Bstc3-43-193 


unknown 


G1/S-S; 
G2/M-M-G1 


178 


Bstc3-4 3-287 




G1/S-S 


179 Bstc3-44-145 




M-G1 


180 I 


Bstc3-44-375 i 


Dutativeprotein 


V1-G1 


181 l 


3stc4-1 1-120 I 


lypotheticalprotein 


32/M-M-G1 


182 I 


3stc4-1 1-320 i 


jnknown j 


VJ-G1 


183 I 


Bstc42-3-115 i 


jnknown i 


M-G1 


184 I 


3stc42-3-125 j 


)utativeprotein ( 


32/M-M-G1 


185 E 


3stc4-23-210 


P 


I/1-G1 


186 E 


3stc42-4-225 l 


/nknown I 


31/S-S-G2 


187 E 


3stc4-32-115 l 


inknownprotein C 


31/S-S; 
32/M-M-G1 


188 E 


3stc4-32-185 i 


nknown c 


51/S-S 


189 £ 


Jstc4-32-190 u 


nknown c 


52/M-M 


190 E 


$Stc4-32-270 u 


nknown c 


S2/S-G2-M 


191 E 


lstc4-32-410 


C 
G 


51/S-S-G2- 
J2/M 


192 B 


stc4-34-250 


G 


I2/M-M-G1 


193 B 


sicA-4 1-230 p 


utativeprotein G 


I2/M-M-G1 


194 B 


Stc4-43-113 u 


nknown N 


I-G1 


195 B 


stc44-3-125 


G 


2/M-M 



31 
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196 


Bsttl -12-340 


unknown 


G2/M-M 


240 


Bstt3-1 1-205 




G1/S-S-G2 


197 


Bsttl 2-2-225 




G1/S-S-G2 


241 


Bstt31 -1-250 


hypotheticalprotein 


G2/M-M-G1 


198 


Bsttl -22-330 


unknown 


G2/M-M-G1 


242 


Bstt31-1-430 


hypotheticalprotein 


G2/M-M-G1 


199 


Bsttl 2-2-420 


unknownprotein 


G2/M-M-G1 


243 


Bstt3-1 2-360 


unknownprotein 


G2/M-M 


200 


Bsttl 2-2-540 


hypotheticalprotein 


G2/M-M-G1 


244 


Bstt31-3-380 




G1/S-S 


201 


Bsttl -23-1 55 




M-G1 


245 


Bstt31-4-420 


hypotheticalprotein 


G2/M-M-G1 


202 


Bsttl 2-3-21 5 


hypotheticalprotein 


G2/M-M-G1 


246 


Bstt32-1 80 


putativeprotein 


G2-M-G1 


203 


Bsttl 2-3-280 


unknown 


G1/S-S-G2 


247 


Bstt3-22-160 


PotatoEST/hypothet 


G1/S-S-G2 


204 


Bsttl 2-3-3 10 


hypotheticalprotein 


G1/S-S 






icalprotein 




205 


Bsttl 2-3-350 




G1/S-S-G2- 


248 


Bstt32-3-175 


unknown 


G2/M-M 








G2/M 


249 


Bstt32-3-325 ' 


unknown protein 


G2/M-M-G1 


206 


Bsttl -24-205 




G2/M-M-G1 


250 


Bstt3-24-135 


unknown 


G2/M-M-G1 


207 


Bsttl -24-220 




G1/S-S-G2 


251 


Bstt3-24-200 




G2/M-M-G1 


208 


Bsttl -31 -170 


hypotheticalprotein 


G2/M-M-G1 


252 


Bstt3-31-215 


unknownprotein 


G2/M-M-G1 


209 


Bstt1-31-215 


unknown 


G2/M-M-G1 


253 


Bstt3-31-330 


unknown 


G1/S-S-G2 


210 


Bsttl 3-210 


unknown 


G2/M-M-G1 


254 


Bstt33-1-350 


unknown 


G2/M-M-G1 


211 


Bsttl 4-4-3 10 


unknownprotein 


G2/M-M-G1 


255 


Bstt33-1-510 


putativeprotein 


G2/M-M-G1 


212 


Bstt2-1 1-165 


unknown 


G2/M-M-G1 


256 


Bstt33-3-220 


unknown 


G2/M-M-G1 


213 


Bstt2-12-190 




G1/S-S-G2 


257 


Bstt33-3-245 


unknownprotein 


G2/M-M-G1 


214 


Bstt21-4-150 


hypotheticalprotein 


G1/S-S-G2/S 


258 


Bstt3-33-550 


hypotheticalprotein 


G1/S-S; M-G1 


215 


Bstt2 1-4-250 - 




Gi/s-sr 


259 


Bstt33-4-T40~~' 


putativeprotein 


S-G2 








G2/M-G1 


260 


Bstt34-2-165 


unknown 


G1/S-S-G2 


216 


Bstt21-4-470 




G2/M-M-G1 


261 


Bstt3-42-325 


hypotheticalprotein 


G2/M-M-G1 


217 


Bstt22-1-170 


unknown 


S-G2 


262 


Bstt3-44-150 


unknown 


G2/M-M-G1 


218 


Bstt2-21-190 


unknown 


G2/M-M 


263 


Bstt3-44-250 


unknown 


G2/M-M-G1 


219 


Bstt22-2-190 


unknown 


G2/M-M-G1 


264 


Bstt34-4-310 


unknown 


G2/M-M-G1 


220 


Bstt22-2-290 


hypotheticalprotein 


G2/M-M-G1 


265 


Bstt3-44-345 


hypotheticalprotein 


G2/M-M-G1 


221 


Bstt22-3-225 




M 


266 


Bstt41 -2-340 




G2/M-M-G1 


222 


Bstt22-3-275 


unknown 


G2/M-M 


267 


Bstt41 -3-310 


unknown 


G2/M-M 


223 


Bstt22-3-315 


TomatoEST 


G2/M-M-G1 


268 


Bstt4-21-185 




M-G1 


224 


Bstt22-3-370 


unknown 


G2/M-M-G1 


269 


Bstt42-1-370 




S-G2-G2/M 


225 


Bstt22-3-390 


putativeprotein 


G2/M-M-G1 


270 


Bstt4-23-480 


unknown 


G2/M-M-G1 


226 


Bstt22-3-480 




G2/M-M-G1 


271 


Bstt4-24-170 




G2/M-M-G1 


227 


Bstt23-1-140 




S-G2-G2/M 


272 


Bstt43-265 


unknown 


G1/S-S-G2/M 


228 


Bstt23-120 


unknownprotein 


G2/M-M-G1 


273 


Bstt43-3-350 


unknown 


G2/M-M-G1 


229 


BSU23-1-200 




S-G2-M 


274 


Bstt4-33-390 


hypotheticalprotein 


G1/S-S;G2/M- 


230 


Bstt2-31-300 


unknown 


S 






M-G1 


231 


Bstt2-32-220 




M 


275 


Bstt4-34-280 




G2/M-M-G1 


232 


Bstt2-32-400 


hypotheticalprotein 


G2/M-M-G1 


276 


Bstt43-4-300 


unknownprotein 


G2/M-M-G1 


233 


Bstt23-3-350 


unknown 


G2-M 


277 


Bstt43-4-330 


unknownprotein 


G2/M-M-G1 


234 


Bstt23-370 


unknown 


G2/M-M-G1 


278 


Bstt43-4-340 




G2/M-M-G1 


235 


Bstt24-1-320 




S-G2 


279 


Bstt44-4-250 


hypotheticalprotein 


G2/M-M 


236 


Bstt24-2-310 




G2/M-M-G1 


280 


Bstt4-44-400 


hypotheticalprotein 


G2/M-M-G1 


237 


Bstt2-43-210 


unknown 


G2-M 


281 


MBC03-90 


unknown 


S-G2 


238 


Bstt2-4 3-240 




S-G2/S 


282 


MBC42-180 


unknown 


G2-M-G1 


239 


Bstt31-1-100 


hypotheticalprotein 


G1/S-S-G2 


283 


MBc43-210 


unknown 


G1/S-S-G2 



Table 5: overview group 4 sequences sho wi ng no homology to known genes 



SEQ 
ID 
NO 


Tag name 


Function 


Fase 


284 \ 


9stc1 1-100 


unknown 


G2/S-G2-M 


285 [ 


5stc1 -11-110 


unknown 


S 


286 &stc1 -11-115 


unknown 1 


G1/S-S;G2/M-M-G1 


287 I 


3stc1 -11-120 




G1/S-S-G2 


288 i 


5stc1 1-1-125 


unknown 


G2/M-M-G1 


289 6stc1 1-1-290 


NaD 


G1/S;G2/M-M-G1 


290 Bstd -12-155 




G2/S-G2-M 


'291 


Bstd -12-175 


unknown 


S 


292 I 


Bstd -12-185 


unknown 


G2/M-M-G1 


293 fcstd 1-3-116 


unknown 


S-G2 


294 $std 1-3-118 


unknown 


G2/M-M-G1 


295 fi 


3stc1 -13-120 




S 


(296 8 


Bstd -13-130 




G1/S-S; G2/M-M-G1 


297 fcstd -13-132 


unknown 


M-G1 



SEQ 
ID 
NO 


Tag name 


Function 


Fase 


298 I 


3stc1 -13-142 


unknown 


G1/S-S 


299 { 


Bstc 11-3-187 


unknown 


S-G2/S 


300 i 


Bstd 1-3-200 


unknown 


G1/S-S-G2/S 


301 f 


Bstd 1-3-290 


unknown 


G2/S-G2-M-G1 


302 \ 


Bstd -14-100 


unknown 


G2/M-M 


303 1 


3std -14-108 


unknown 


G2/M-M-G1 


304 


3std 1-4-130 


unknown 


G1/S-S-G2 


305 1 


Bstd 1-4-135 


unknown 


G2/M-M-G1 


306 I 


3std 1-4-140 


unknown 


S-G2-M 


307 Bstd -14-155 




|G2/M-M 


[308 fi 


3std -14-165 




G2-G2/M 


309 


3stc1 -14-167 




G2-G2/M 


310 t 


bstd 1-4-175 




G2/M-M-G1 


311 $std 1-4-200 


unknown 


IG1/S-S 



32 



WO 03/085115 



1312 festcl 2-1-110 



313 festd -21-150 



314 festd 2-1-160 



315 festd 2-1-240 



316 gstd 2-1-95 



317_|stc1 -22-110 



318 festcl 2-3-103 



319 festd 2-3-125 



320 festcl 2-3-235 



321 ftstd 2-3-237 



322Bstc1 2-4-130 



323 Bstcl 2-4-133 



324 festcl 2-4-145 



325 



istd 2-4-235 



326J3stc13-1-150 



unknown |S-G2 



unknown G2/M-M.m 



unknown G2-M-G 1 



unknown M-G1 



unknown G1/S-S-G 2" 
1G2-M-G1 



unknown G?/M-M,^T 



unknown 1G1/S-S: G1 



unknown 



unknown 



unknown 



unknown 



M-G1 



G1/S-S 



stc22-1-98 (unknown 



378 Bstc2 2-2-110 

379 gstc2 -22-160 



PCT/EP03/03703 
-G2-G2/M 



382 gstc2 -23-110 



unknown G2/M-M-G1 
'unknown |gi/S-S; G2-G2/M 



IS; G2-M 



G2/M-M-G1 



327 Bstcl 3-2-170 



328 Bstd 3-2-180 
329Bstc1 3-2^190 



nknown 



S-G2 
M-G1 



383 Bstc2 -23-140 

384 Bstc2 2-3-150 

385 1 3 stc2-23~175 



unknown 



G2/M-M-G1 



M-G1 



unknown 
unknown 



G2/M-M-G1 
G1/S-S 
G1/S-S 



333 festd -41-1 RO 



336 festd 4-2-140 



334 Bstd -41-210 



335 festd -41-230 



unknown 



unknown 



'unknown 



337 fesicl -42-150 lunknown" 



338 



G1/S-S; G2/M-M-G1 



G1/S-S 



G1/S; G2/M-M-G1 



M-G1 



339j^stc1 -42~90 unknown 



G2/S-G2 



std -42-80 unknown G 1 /S-S-G2 



340 festd -43-105 



341 festd 4-3-105 
tstd -43-110 
stcl 4-3-130 l unknown" 
344 fesici -43-140 unknowTT 



G2-M 



387 Bsfc2 2-3-90 

388 gstc2 -24^00 



_ 391 festc2 -24-170 imknnumU^c; g 



393 Bstc2 -31-160 

394 Bstc2 -31-170 



G2/M-M 



G1/S-S; G2/M- M 

=-S; G2-M 
G2/M-M-G1 



[stcl -43-150 
tetcl -43-1 75 



S-G2 



350 



std -43-185 unknown 



348J3stc1 4-3-235 unknown 



349J3sici 4-3-260 lunknowTi 



G2/M -M-G1 
S-G2 



G1/S-S-G2/S 



G1/S-S 



G2/M-M-G1 



std -43-65 unknown Gl/S-S-re? 




356 Bstc2 1-1-100 



351 festcl -43-75 unknown 



352 festd -44-138 



353 Bstd -44-140 



unknown 



unknown 



3^L|§tc2^^ 



358 Bstc2 1-1-145 

359 Bstc2 1-1-65 



3§6]festc2 1-2-120 



unknown 



G1/S-S-G2/S 



G2/S-G2-M 



G2/M-M-G1 



unknown 



G1/S-S-G2 



36lfestc2 1-2-215 



362 festc2 1-2-75 



3 g3festc2 -13-110 



370 pstc21"4-155 



3Zlfestc2-l4-160' 



372_festc2 1-4-1 80 



unknown 



M-G1 



G2-M-G1 



G2/M-M 



G1/S-S;G2/M-M 




unknown 



Z3Bstc2 2-1 00 



3Z±Pstc2_ -21-120 



unknown 



G2/M-M-G1 
'M-G1 ~ 



G2/S-G2 



unknown |G2-M 



unknown 
unknown 
unknown 



400 festc2 -33-200 



unknown f G2/M- M 

I-G1 



>-G2 
IM-G1 



unknown M-G1 
M-G1 

unknown IG2/M-M-GT 



[G1/S-S-G2-M 



JG2/M-M 



396 festc2 -32-285 



398 Bstc2 3-2-80 

399 fcstc2 3-3-175 



M-G1 

unknown IM-G1 



G1/STS;G2 ^1-M-G1" ^^f^T^^T^ 

fe 1/ e;q J U — I 3-2-135 unknown lG2/M-iVU?l 



G27M-M j 

unknown G1/S; G2/M-M-G1_J 



unknown G2/M-M" 



G2 



401 fes tc23-3- 

402 Pstc2 -33^85 



403 Bstc2 -33-95 

404 festc2 3-4-110 



406 Bstc2 3-4-310 



08 Bstc2 -41-110 
409 Bstc2 4-2-165 



unknown 



M-G1 
S-G2 



unknown 
unknown 



G2/M- M-G1 

IG2-M" 



stc2 3-4-120 unknown G1/S-S-G? 



G2-M-G1 



410 Bstc2 -43-105 i.nknn Wn 



unknown S-G2 



M-G1 



414 festc2 -43-90 unknown G2/M-M-fti 



i!5festc24^-125 unknown 



416 festc2 -44-175 



412 festc2 4-3-285 



G2/M-M 



G1 



413 Bstc2 -43-77 unknown G2/M-M-G 1 



417 festc2 4-4-220 



418 Bstc2 4-4-230 

419 Bstc2 -44-95 



424 festc3 -12-115" 



425 Bstc3 1-2-190 

426 Pstc3 1-3-127 



422 



420 Bstc3 1-110 



421 festc3 1-1-250 



stc3 1-1-77 



423 gstc3 1-1 -90 



4 27 Bstc3 1 -3-235 
428J Bstc3 -1 3-330 
429 festc3 1-3-60 
430 Bstc3 1-3-80 



433 Bstc3 -14-105 



431 festc3 -13-90 



unknown 



unknown 



unknown 



G1/S-S 



G2/M-M-G1 



G2/M-M-G1 



G2-G2 /M 



G1/S-S 



unknown M-G1 



unknown M-G1 



unknown 
unknown 



G2/M-M 



M-G1 



G1/S-S-G2 
G1/S-S-G2/M 



unknown 
nknown 



432Bstc3 -13-95 



434 Bstc3 -14-110 
435Bstc3 -14-125 



unknown 



unknown 



unknown 



unknown 
unknown 



436 Bstc3 -14-130 unknown 




unknown 



S^32 
G1_ 

G2-M 

S-G2-M-G1 



G2/M-M-G1 



M-G1 



M-G1 



M-G1 
G2/M-M-G1 



i1/S; M-G1 



G1/S-S-G2 



40fe stc3 2-2-100 [unknown 1G1/ S-S-G2 
tll_festc3 2-2-270 



M-G1 



G1/S; G2/M-M-G1 



WO 03/085115 



442 festc3 2-2-390 


unknown 


G2/M-M-G1 


443 


Bstc3 2-2-93 


unknown 


G2/M-M 


444 


pstc3 2-3-100 


unknown 


S-G2 


445 8stc3 -23-125 


unknown 


G2/M-M-G1 


446 festc3 2-3-155 




S-G2-M 


447 pstc3 -23-175 


unknown 


G2/M-M-G1 


448 


Bstc3 -23-177 




G2/S-G2-M-G1 


449 


Bstc3 2-3-63 


unknown 


S-G2 


450 


Pstc3 -23-65 




S; G2-M-G1 


451 6stc3 -24-155 


unknown 


G2/M-M-G1 


452 


Bstc3 2-4-230 


unknown 


G2/M-M 


453 


Bstc3 2-4-250 


unknown 


G2/M-M-G1 


454 


Bstc3 -24-255 


unknown 


G2/M-M-G1 


455 


Bstc3 -24-305 




G2-M-G1 


456 


Bstc3 -24-340 


unknown 


G1/S-S; M-G1 


457 Bstc3 -24-90 




M-G1 


458 


Bstc3 -31-130 


unknown IG1/S-S-G2 


459 


Bstc3 3-120 


i ink nnu/n 


O 1 / o-o 


460 


Bstc3 -31-200 






461 Bstc3 -31-260 


UI IM IUWI 1 




462 


Bstc3 3-1 50 — 


uiiNnown 




463 


Bstc3 -32-105 


urmnown 




464 


Bstc3 -32-120 




o ]/o-o, oZ/M-IVl-ol 


465 I 


festc3 -32-240 


UI ll\i IUW| I 




466 I 


Bstc3 -32-320 




o l/o-o-o^, (Vi-ol 


467 I 


Bstc3 3-280 


UI IM IUWI I 


VJxi-IVl-O 1 


468 I 


Bstc3 3-2-90 


UI IM IUWI 1 




469 I 


3stc3 3-3-105 


unknown 


G2/M-M-G1 


470 f 


3stc3 3-3-115 




G1/S-S; M-G1 


471 I 


Bstc3 3-3-165 




G1/S-S-G2/S 


472 I 


&stc3 -34-110 




G2/M-M 


473 Bstc3 3-4-165 




G2/M-M 


474 Bstc3 3-4-200 




S 


475 &s\c3 -34-290 


unknown 


G2/M-M-G1 


476 I 


3stc3 -34-85 


unknown 


G2-M-G1 


477 I 


3stc3 -34-90 


unknown 


G1/S-S 


478 i 


Bstc3 3-90 


unknown 


S 


479 Bstc3 4-115 




G2-M-G1 


480 


Bstc3 -41-180 




G2/M-M-G1 


481 \ 


3stc3 4-13-300 


unknown 


G/S-S;M-G1 


482 Bstc3 4-3-100 




M-G1 


483 Bstc3 4-3-135 




S-G2-G2/M 


484 $stc3 4-3-190 




S-G2-M-G1 


485 t 


Sstc3 -43-210 


unknown 


G1/S-S; M-G1 


486 Bstc3 4-3-210 


jnknown 


G2/S-G2-G2-G2/M 


487 £stc3 -43-240 




G1/S-S; G2/M-M-G1 


488 t 


Jstc3 4-3-248 


unknown 


S 


489 i 


*stc3 4-3-263 


jnknown 


G2/M-M-G1 


490 1 


istc3 -43-280 


jnknown |G2/M-M-G1 


491 E 


*stc3 4-3-95 


jnknown 


3 


492 J 


Sstc3 -44-155 


jnknown < 


31/S-S; M-G1 


493 E 


*stc3 -44-173 




32/M-M-G1 


494 I 


*stc3 4-80 


jnknown ! 


5-G2/S 


49b t 


!stc4 -11-117 


{ 


32/M-M-G1 


496 E 


$stc4 1-1-125 


jnknown I 


\A-G1 


497 0stc4 1-1-130 


jnknown < 


32-M-G1 


498 Bstc4 -11-180 


( 


32/M-M-G1 


499 ^stc4 1-1-195 


jnknown ( 


31/S-S-G2 


500 Bstc4 1-1-197 


jnknown < 


32/M-M-G1 


501 §s\c4 -11-210 


jnknown < 


31/S-S-G2/S 


502 Bstc4 1-1-210 


jnknown ( 


31/S-S-G1/S 


503 Bstc4 1-1-245 (unknown f 


\/l-G1 


504 Bstc4 -11-350 Unknown ( 


32/M-M 


505 Bstc4 1-1-90 (unknown C 


32/M-M-G1 


506 Bstc4 -12-150 (unknown ( 


32-M-G1 



507 


festc4 1-2-280 




PCT/EP03/03703 
S-G2-M 


508 


Pstc4 -13-112 


unknown 


S-G2 


509 Estc4 1-3-170 


unknown 


G1 /S-S 


510 festc4 1-3-205 


unknown 


G2/M-M-G1 


511 


$stc4 -13-280 


unknown 


G1/S-S-G2/S 


512 6stc4 -13-70 


unknown 


G2/M-M-G1 


513 Pstc4 1-4-105 




M-G1 


514 


Bstc4 1-4-112 


unknown 


G2/M-M 


515 


Bstc4 -14-120 


unknown 


G1/S-S; M-G1 


516 


Pstc4 1-4-127 


unknown 


S-G2-M 


517 Bstc4 1-4-145 


unknown 


G2/M-M-G1 


518 


Pstc4 -14-160 


unknown 


G2/M-M-G1 


519 6stc4 1-4-165 


unknown 


G2-M-G1 


520 


kstc4 1-4-185 




G1/S-S-G2 


521 


Bstc4 1-4-270 




G1/S-S; G2/M-M-G1 


522 


Bstc4 2-1-150 


unknown 


G2/M-M-G1 


523 


Bstc4 -21-155 




G1/S-S-G2 


524 


Bstc4 -21-200 


unknown 


S; G2/M-M-G1 


525 


Bstc4 2-135 


unknown 


G2/M-M-G1 


526 


Bstc4 -22-150 


unknown 


G1/S-S; G1 


527 


Pstc 42-2-1 70 




S-G2-M 


528 Pstc4 2-2-185 




M-G1 


529 I 


Bstc4 2-2-220 


unknown 


M-G1 


530 I 


Bstc4 2-3-100 


unknown 


M-G1 


531 I 


Bstc4 -23-115 


unknown 


M-G1 


532 


3stc4 2-3-133 




S-G2/S 


533 I 


3stc4 -23-135 


unknown 


G2/M-M-G1 


534 I 


Sstc4 2-4-110 


unknown 


G1/S-S; G2/M-M-G1 


535 I 


3stc4 -24-240 




G1/S-S-G2 


536 I 


3stc4 -31-260 




G2/M-M-G1 


537 [ 


Bstc4 -31-310 


unknown 


S; G2/M-M-G1 


538 ( 


3stc4 3-3-100 




S-G2-M 


539 8 


3stc4 3-3-103 


- . ,_ 

unknown 


G2/M-M-G1 


540 I 


Bstc4 3-3-135 




M-G1 


541 


3stc4 3-3-175 




G2/M-M-G1 


542 I 


3stc4 3-3-250 


unknown 


M-G1 


543 £ 


3stc4 -34-135 


unknown 


G2/M-M-G1 


544 I 


3stc4 -34-185 




G1/S-S 


545 Pstc4 3-4-200 


unknown 


G2/M-M-G1 


546 


Bstc4 3-4-320 




G1/S-S 


547 


3stc4 -41-100 


unknown 


G2-M 


548 $stc4 -41-105 


unknown 


G1 /S-S; G2/M-M-G1 


549 6 


3stc4 -41-107 


unknown 


G2/M-M-G1 


550 f 


Sstc4 -41-125 


unknown 


M-G1 


551 i 


Jstc 4-41-180 




G2/M-M-G1 


552 I 


5stc4 -41-220 


unknown 


M-G1 


553 I 


3stc4 4-150 


unknown 


G2-M-G1 


554 ( 


IsXcA -42-110 


unknown 


G2/M-M-G1 


555 I 


istc4 -42-115 ! 


unknown 


G2/M-M 


556 E 


Jstc4 -42-130 i 


jnknown 


S-G2 


557 6 


istc4 -42-165 i 


unknown 


G1/S-S; M-G1 


558 E 


istc4 -42-217 i 


jnknown 


G2/M-M-G1 


559 i; 


5stc4 -43-103 i 


jnknown 


G1/S-S-G2-G2/M 


560 I: 


$Stc4 4-3-167 i 


jnknown 


32/M-M-G1 


561 E 


Jstc4 4-3-170 


I 


M-G1 


562 E 


istc4 4-4-120 i 


jnknown 


V1-G1 


563 E 


$stc4 4-4-290 i 


jnknown I 


32/M-M-G1 


564 E 


isttl -11-190 


( 


31/S-S 


565 E 


Jsttl -11-200 i 


jnknown < 


31/S-S-G2-G2/M 


566 E 


!stt1 -11-55 i 


jnknown I 


31/S-S 


567 | 


isttl -11-65 i 


jnknown I 


31/S-S-G2 


568 ^sttl -12-105 i 


jnknown ( 


32/M-M 


569 Bsttl -12-115 


< 


31/S-S 


570 E 


isttl -12-230 | 




3-G2 


571 E 


•stt1 -13-150 unknown ( 


32/M-M 



WO 03/085115 



Of d. 


pSttl -13-230 


unknown 


G2/S-G2-M 


O/O 


Bsttl -14-125 


unknown 


G1/S-S 


0 /4 


IpSttl -14-220 


unknown 


G2/M-M 


C 7C 

575 


tT^ -.11 «4 A *i j AA 

Bsttl -21-100 


unknown 


G2/M-M 


576 


Bsttl 2 -1-240 


unknown 


S-G2-M 


577 


fT"V . * 1 a A. A »A *■* A 

Bsttl -21-250 


unknown 


S; G2/M-M-G1 


578 


A * j A f\ 4f*% -At #*x 

Bsttl 2 -2-100 


unknown 


G2/S-G2-M-G1 


579 


[r"\ _ it A A A ^4 A A 

Bsttl 2 -2-140 


unknown 


G2/M-M-G1 


r on 


Bsttl -22-160 




G2/M-G1 




Bsttl 2 -2-215 


unknown 


G2/M-M 


coo 
oo*£ 


fc> j->4± «1 OO OIP 

Bsttl -22-225 




M-G1-G1/S 


coo 

583 


Bsttl 2 -2-360 


unknown 


G2/M-M-G1 


C O >1 

bo4 


3 Stti -22-70 


unknown 


G1/S-S 


con 

585 


I3stt12 -3-1 15 


unknown 


G1/S-S-G2 


c 0 c* 

586 


' -k ±l A A A *4 ^ a 

Bsttl -23-150 


unknown 


G2-M-G1 


587 


Bsttl -23-170 


unknown 


G2-M 


588 


!3stt12 -3-170 


unknown 


G1/S-S 


I™* t~\ 

589 


(3stt1 -23-180 


unknown 


G2/S-G2-M 


590 


Bsttl -23-185 


_ IG2-M-G1 


591 


3stt1 -23-235 


unknown 


G2-M 


592 I 


3stt1 -24-105 


unknown 


G2/S-G2-M-G1 


593 1 


3stt1 -24-120 


unknown 


G2/M-M-G1 


594 1 


3stt1 2 -4-260 




G2/S-G2-G2/M 


f— A f" I 

595 ! 


Bsttl 2 -4-320 




G2/M-M 


596 1 


-\ J 1 i A -4 .4 AA 

Bsttl -31-120 




G2/M-M-G1 


597 i 


Bsttl -31-180 


unknown 


G2/M-M-G1 


598 1 


3stt13-170 


unknown 


G1/S-S-G2 


599 [ 


Bsttl 3 -2-1 50 




G1/S-S-G2 


600 t 


Bsttl -32-170 


unknown 


GVS-S-G2 


601 i 


3stt1 -32-185 




G1/S-S 


602 E 


Bsttl 3 -3-1 00 


unknown 


G1/S-S-G2-M 


603 I 


&stt1 -33-1 70 


unknown 


G1/S-S-G2 


604 I 


3stt13-3-320 


unknown 


G2/M-M-G1 

1 VI IVI 1 


605 I 


3stt1 -33-66 




G2/M-M 


606 I 


3stt1 -41-120 


unknown 


G2/M-M 


607 l 


3stt1 -42-264 


unknown 


G2-M-G1 


608 I 


Bsttl 4 -2-280 


unknown 


G2/M-M-G1 


609 I 


Bstt14 -3-120 




S-G2 


610 I 


Bsttl 4 -3-140 


unknown 


G1-S-S-G2 


611 I 


Bsttl -43-220 


unknown 


G2/S-G2-G2/M 


612 I 


Bsttl -43-330 


unknown 


G2/M-M-G1 


p13 f 


3stt14 -3-460 


unknown 


G2/M-M 


614 I 


Bsttl 4 -4-130 


unknown 


S-G2 


615 I 


Bsttl 4 -4-150 


unknown 


G2 


616 I 


Sstt14-4-195 




S-G2-M 


617 I 


Jstt14-4-220 




32/S-G2-G2/M 


618 i 


is«14 -85 i 


lohits 


32/M-M 


619 I: 


istt21 -1-170 


jnknown 


32/M-M 


DO A 1 ■ 

bZO I: 


^stt2 -11-290 


( 


32/S-G2-G2/M 


A A -4 w 

621 I: 


istt2 -11-540 


( 


31/S-S 


622 [: 


Jstt21 -2-190 


( 


32/M-M-G1 


AAA V - 

623 I: 


Jstt2 -13-165 


\ 


3-G2-M 


624 I: 


Jstt2 -13-170 1 


jnknown ( 


32/M-M 


625 [5 


»stt2 -14-130 1 


jnknown 


32/M-M 


A A A ala 

626 t 


!stt2 -14-175 1 


jnknown I 


5-G2 


627 F 


)s«22 -1-140 1 


jnknown I 


3-G2 


628 f 


Jstt2 -21-300 1 


jnknown ( 


32/M-M 


629 £ 


istt22-2-110 1 


jnknown ( 


31/S-G2 


630 P 


lstt22 -2-255 


( 


31/S-S-G2-G2/M 


631 f 


!stt22 -2-370 


( 


31/S-G2 


632 i 


istt22 -3-100 1 


jnknown ( 


32/M-M-G1 


633 $stt22 -3-145 l 


jnknown ( 


32/M-M-G1 


634J 


»stt2 -23-220 1 


jnknown ( 


32-M-G1 


635J 


stt2 -23-370 


( 


31/S-G2 


636 p>tt22 -4-145 i 


jnknown C 


32/M-M 



PCT/EP03/03703 



637 


festt22 -4-170 




S-G2 


638 


IBstt22 -4-175 




G2-M 


639 


Bstt22 -80 


unknown 


G2/M-M 


640 


Bstt23 -1-128 


unknown 


S-G2 


641 


Bstt23 -1-155 


unknown 


S-G2-G2/M 


642 


Bstt2 -31-200 


unknown 


G2/S-G2 


643 


Bstt23-170 


unknown 


G2/M-M-G1 


644 


tstt2 -32-175 


unknown 


G2/S-G2-G2/M 


645 


Bs U23-220 




G1/S-S-G2 


646 


Bstt23 -3-200 




G1/S-S-G2/S 


647 


Bstt23 -3-265 ' 




S-G2-G2/M 


648 


Bstt23 -3-330 




G1/S-S 


649 


Bstt2 -34-170 


unknown 


G2/M-M-G1 


650 


Bstt23 -4-180 




S-G2-M 


651 


(Bstt23 -4-210 




G2/M-M-G1 


652 


I3stt2 -41-170 


unknown 


G1/S-S-G2 


653 


[Bstt24 -1-170 


unknown 


S-G2 


654 


I3stt2 -41-390 




S-G2 


655 


IBstt2 -42-300 




G2/M-M-G1 


656 


I3stt24 -2-318 




S-G2 


657 


Bstt24 -2-320 


unknown 


G2/M-M-G1 


658 


Bstt24 -290 


unknown 


G2/M-M 


659 


Bstt2 -43-150 




S-G2 


660 


Bstt2 -43-160 




S-G2/S 


661 I 


Bstt2 -43-50 




S 


662 I 


3stt2 -43-65 


unknown 


S-G2 


663 I 


3stt2 -44-230 




G2/S-G2-M 


664 f 


3stt2 -44-240 


unknown 


G1/S-S-G2 


665 I 


3stt24.-4-240 


unknown 


G1/S-S-G2/S 


666 I 


Bstt24 -4-260 


unknown 


G1/S-S 


667 E 


Bstt24 -4-283 


unknown 


G1/S-S-G2 


668 I 


3stt24 -4-285 


unknown 


G2/M-M-G1 


669 pstt31 -1-145 




S-G2-M 


670 I 


3stt31 -1-210 




G2/M-M-G1 


671 b 


3stt31 -2-165 


unknown 


G2/S-G2 


672 E 


3stt31 -2-185 




G2/M-M-G1 


673 i 


3s«3 -12-200 


unknown 


G2/M-M-G1 


674 4 


3st13 -12-315 




S-G2-M 


675 pstt31 -2-330 




G2/M-M-G1 


676 B 


3stt3 -13-110 


unknown 


S-G2-G2/M 


677 i 


Bstt31 -3-180 




S-G2-G2/M 


678 e 


Jstt3 -13-360 




G2/M-M 


679 t 


Bstt3 -14-130 


unknown 


G2/M-M 


680 Bstt3 -14-135 


unknown 


G2/M-M 


681 E 


3stt31 -50 j 


unknown 


G1/S-S-G2-G2/M 


682 i 


5stt32 -1-105 




S-G2 


683 pstt3 -21-165 




G2/S-G2 


684 pstt3 -21-305 i 


jnknown 


G2/M-M 


685 E 


Jstt32-140 i 


jnknown 


S-G2/S 


686 E 


istt3 -22-100 


i 


32/M-M-G1 


687 E 


>stt32 -2-210 




S-G2-M 


688 E 


Jstt3 -22-280 i 


jnknown < 


31/S-S;M-G1 


689 Bstt32 -2-510 i 


jnknown J 


S-G2-G2/M 


690 6stt32 -3-115 


i 


32/S-G2 


691 E 


Istt32 -3-155 1 


jnknown ! 


5-G2 


692 E 


;stt32 -3-160 


I 


VI 


693 c 


Istt32 -3-180 t 


jnknown ( 


31/S-S-G2 


694 E 


»stt3 -23-205 1 


jnknown i 


3-G2-M 


695 E 


:stt3 -23-65 1 


jnknown ( 


32/M-M-G1 


696 E 


istt32 -4-170 1 


jnknown i 


5; M 


697 6stt32 -4-195 


( 


31/S-S;G2/M-M-G1 


698 0stt32 -4-260 1 


jnknown ( 


31/S-S 


699 B 


stt3 -24-390 




tf-G1 


700 b 


stt33 -1-105 


( 


31/S-S-G2 


701 E 


stt33 -1-128 




3-G2 
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702 festt33 -1-1 32 lunknown (G2/M-M 


703 Bstt33 -1-160 i 


jnknown ( 


32/M-M-G1 


704 t 


lstt33 -1-185 


I 


V1-G1 


705 fes«33 -140 i 


jnknown < 


32/M-M-G1 


706 E 


istt33 -2-75 i 


jnknown < 


31/S-S-G2 


707 $stt33 -2-85 


( 


31/S-S; G2/M-G1 


708 festt33 -3-110 


i 


31/S-S; G2/M-M-G1 


709 Sstt33 -3-125 i 


jnknown ' 


32/M-M-G1 


710 Bstt3 -33-170 i 


jnknown ! 


5-G2/S 


711 f 


Jstt33 -4-110 




S-G2 


712 E 


Jstt33 -4-120 i 


jnknown < 


31/S-S-G2 


713 1 


5stt33 -4-130 i 


jnknown 


G2/M-M 


714 | 


Ssti33 -95 i 


jnknown 


G2/M-M 


715 E 


Sstt34 -1-110 




S-G2-G2/M 


716 E 


Sstt34 -1-170 




G1/S-S-G2-G2/M 


717 e 


3stt3 -42-350 


unknown 


G2/M-M-G1 


718 I 


3stt3 -43-145 


unknown 


G2/M-M-G1 


719 E 


3stt3 -43-190 


unknown 


G1/S-S; M-G1 


720 


3stt3 -43-265 




G2/S-G2-M-G1 


721 II 


3stt3 -43-280 


unknown 


G2/M-M-G1 


722 I 


3stt34 -70 


unknown 


S 


723 g 


3stt41 -3-1 00b 


unknown 


G2/M-M 


724 I 


3stt41 -3-130 


unknown 


G2/M-M-G1 


725! 


3stt41 -3-140 


unknown 


G2/M-M-G1 


726 f 


3stt41 -3-180 




G2-M 


727 


3stt41 -3-230 


unknown 


S-G2 


728 t 


3stt41 -3-90 


unknown 


G2/M-M-G1 


729 i 


3stt41 -4-210 


unknown 


S-G2-M-G1 


730 £ 


3stt4 -14-500 




G2/M-M-G1 


731 I 


Bstt41 -70 


unknown 


G1/S-S 


732 \ 


3stt42 -1-130 


unknown 


G2/M-M-G1 


733 t 


3stt42 -1-290 


unknown 


G2/M-M 


734 t 


3stt4 -21-60 


Unknown 


S-G2 


735 I 


BsU4 -22-100 




M-G1 


736 I 


3stt4 -22-360 




S-G2 


737 I 


Bstt42 -3-1 05 


unknown 


G1/S-S-G2/S 


738 ! 


^stt42 -3-1 1 0 


unknown 


G2/M-M-G1 


739 i 


pstt4 -23-130 




S-G2/M 


740 I 


kstt4 -23-160 




G2/S-G2-M 


741 ! 


3stt42 -4-150 


unknown 


G1/S-S-G2 


742 ! 


3stt4 -24-270 


unknown 


G2/M-M-G1 


743 I 


3stt42 -4-390 


unknown 


M-G1 


744 I 


!3stt43 -1-290 


unknown 


G2/M-M-G1 


745 


!3stt43-1-85 




G1/S-S-G2/S 


746 


I3stt4 -32-230 


unknown 


G1/S-S-G2/S 


747 


I3stt43 -2-238 




G2/M 


748 


[3stl43 -3-145 


unknown 


G1/S-S-G2 


749 


Bstt43 -3-210 




G2/M-M-G1 


750 


Bstt43 -4-230 


unknown 


G2/M-M-G1 


751 


Bstt4 -34-75 


unknown 


G2/S-G2-M 


752 festt44 -1-125 


unknown 


S-G2-G2/M 


753 


festt44 -185 


unknown 


M-G1 


754 


Bstt44 -2-135 




G2/M-M-G1 


755 Bstt4 -42-150 


unknown 


M 


756 festt4 -42-390 


unknown 


M-G1 


757 festt44 -3-240 


unknown 


G2/M-M-G1 


758 festt44 -3-250 


unknown 


S-G2-G2/M 


759 6stt4 -44-148 




G2/M-M-G1 


760 U Bc02-1 00 


unknown 


G2/M-M 


761 U Bc02-120 


unknown 


G2/M-M 


762 
763 


M Bc03-110 


unknown 


G2/M-M 


M Bc03-85 




G2/M-M 


764 fJ\ BcH-135 


unknown 


G2-M 


765 
766 


W BC12-150 




S-G2-M 


tfi Bc31-185 


unknown 


G2/M-M 
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767 VI Bc32-107 


unKnown 


o*c/lvJ-|vl-vj 1 


768 M BC32-1 10 


unknown 


O^/IVi-lVl-O 1 


769 iyi Bc41-1 10 


unKnown 


0 i/o-o, OZ/IVl-IVl 


770 M BC42-28U 


unKnown 


O^-lVI 


771 M BC43-95 


unKnown 




^•Tn ft A f~) A A . A O r\ 

772 lyl Bc44-130 






773 M BC44-95 


unKnown 


O^/IVI-IVI 


774 M Bt12-o0 


unKnown 




775 iyi Bt12-95 




IV) 


776 If/l Bt13-105 


unKnown 


|V|-VJ3 1 


777 {/I Bt14-100 


unKnown 


OZ/jVi-IVl-O 1 


778 M B114-85 


unknown 




779 Bt14-90 


unknown 




780 iyi Bt31-95 






1 _AWm At mm ¥ m t. M A***. A^K. _A\ At ^™ 

781 Bt33-115 






700 A/t D4QO "1 OO 




VJC. Ill 


783 M Bt42-1 35 


unknown 


G2-M 


784 M Bt43-95 


unknown 


G2-G2/M 


785 B144-145 


unknown 


G1/S-S-G2-M 
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CLAIMS 

1. A method for identifying and validating plant genes/proteins as targets for agrochemicals, 
said method comprising the steps of: 

a. Determining gene or protein expression profiles during a biological process of a plant or 
plant cell, said biological process being necessary for the growth and/or development 
and/or viability of the plant or plant cell; 

b. Selecting genes or proteins having altered expression during said biological process, 

c. Cloning said selected gene or the nucleic acid encoding said protein in its full-length or 
partial form, 

d. Incorporating said nucleic acid in a vector designed for downregulation of expression of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant 
cell. 

2. The method according to claim 1 , wherein said biological process cell division. 

3. The method according to claim 1 or 2, wherein said gene or protein expression profiling is 

based on nucleic acid or protein samples collected from a synchronized culture of dividing 
plant cells. 

4. The method according to claim 3,wherein said dividing plant cells are tobacco BY2 cells. 

5. The method according to any of claims 1 to 4, wherein the expression profiles are 
determined by means of micro-array, macro array or c-DNA-AFLP. 

6. The method according to any of claims 1 to 5, wherein said downregulation involves a 
viral-induced gene silencing mechanism. 

7. The method according to any of claim 1 to 6, wherein said downregulation involves the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 

8. A method for screening candidate agrochemical compounds comprising the use of any of 
the methods according to claim 1 to 10. 

9. A method for screening candidate agrochemical compounds comprising the use of any 
one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 
thereof or one or more of the proteins corresponding to SEQ ID NO 1 to 785 or a 
homologue, functional fragment or derivative thereof . 
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10. A method for the production of an agrochemical resistant plant, comprising the use of any 
one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 
thereof or one or more of the proteins encoded by SEQ ID NO 1 to 785 or a homologue , 

5 functional fragment or derivative thereof. 

11. An isolated nucleic acid identifiable by any of the methods according to claims 1 to 10. 

12. An isolated nucleic acid, comprising at least part of a nucleic acid sequence chosen from 
1 0 the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative thereof. 

13. -Use of a gene nucleic- acid according to claim 11 or .12 or the protein encoded by said 
isolated nucleic acid as a target for an agrochemical compound. 

15 14. Use of a nucleic acid or protein according to claim 13, wherein the agrochemical 
compound is a herbicide. 

15. A plant tolerant to an agrochemical, in which the expression level of one or more of the 
nucleic acids corresponding the SEQ ID NO 1 to 785 or the homologue, functional 

20 fragment or derivative thereof, is modulated. 

16. A harvestable part of a plant according to claim 15. 
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FIGURE 2 
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FIGURE 2 (continued) 
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Exp. Ill - 37 days after inoculation 
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Sequence Listing 

GROUP 1 



SEQIDNOl 

GAATTCACTAGTGATTGATGAGTCCTGAGTAAGGTGAGACGAGAAGCGACCTTCTGACCACA 

AGACTTGTCAGCCTGAGACAGGTATGATATCCATATACTGCGTATCTCATAAGTGACTCGTG 

GATCGGATAAATGCTCAACCCATTTGCTAACATATCTGTCTTGCCTGTCAGGTTCCCAGGAT 
CACTACGCAGTCAATCGAATTCCCGCGGCCTATAGTGAGTCGTATTAA 

SEQIDN02 

TGACTGCGTAGTGATCTCAAAGAAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCT 

TCTTCAGGGTGCTGCAAACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTT 

CACACAGTGTTTTTACATGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCAC 
CGGTTTGCTCGTCTTACTCAGGACTCATCA 

SEQIDN03 

GNNATGCCCGATNTAAGCCGCCCCTANATACANTTNAAATGGTCCCGGANACCCTGGGNGAC 
AATNATNGACTGNGCAGTGGTTGAAGNTTGACAATTCCTATT 

SEQIDN04 

CNCNATTNTNAATAAGCCCGAAAAAGAAGAAGTAANGGAGGGAGAAGGCCTGAAGAGAATTT 
GCNGGATTTTCTAGAGTTGATACAGAAGTTCTTCCAAACTGTTCCACCTTGGATTATGACAA 

GTAGTGTCACCAACCAAGGTCGAGATGAGATACTGTTGCACATGTCTCAGCTGCGAAACTAT 
TGGCTCAAGCATTGAGTTGGCATCATATGA 

SEQIDN05 

TTTTANGNCANCAAATCTCNCTCTAACGGACCCTNGCATGGCTTGTTCAAAATAAATGCCTC 

AGGACAATACCACGTTATGTAATGGGGAGTGAACTGCGTATATCCGTTCTGCTCNTTTATCT 

GGGCGGNGCCTTTGAAGTTTTTGACAAACTCTNTCTGGNTCTCACACTTAGGGCCACACTCA 

TCATTACTGTTTGTCCAAAACTCGTACTCAACTCTTTCATCGGGATGTGGAAGCGCCTCTCT 
CCAATCAAGGTTTATG 

SEQIDN06 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
TCACAAAATGAACANCAGCCNTTGAGAATCNCAGCTGTGNTATGTANNTTCGAAGACATTGG 

CTGAGGATGCTGCATGGAAGTTTGTGAAAGAGAAAGCCTATCGATATGGTTACGATAAACCC 
AGCAATGGTTATTGGCGGTTTGTTACAACCAATAC 

SEQIDN07 

CCCAAGATGAACAGTCAGTCAAGTCGATCTCATGCCATATTTACAATTACATTGGAACAAAA 

GAGAATGGCTAATTGCTCGACGAACGATGATGGTGATGACATATTATGTGCCAAGCTTCATT 

TGGTTGACCTTGCTGGTTCAGAGCGAGCAAAGCGAACTGGAGCTGATGAGATGCGTTTACGA 

NAGGGNATTCATATCAACAGGGGATTGCTTGCTCTTGGCAATGTAATAAGTGCCCTTGGTGA 

TGAAAAGAAGCGGAAAGAAGGNGCACACATCCCATACAGAGATAGCANGTTGACACGTNTCT 

TACAGGACTCACTTGGAGGAAACAGCANGACAGTTATGATTGCTTGTGTCAGTCCTGCTGAC 
ACCAATGCAGAGGAGACCC 
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SEQIDN08 

GCGGTTGATATGTGGTCTGTGGGATGTATTTTTGCCGAGATGGTTCGAAGGCAAGCCTTATT 
TCCTGGTGACTCTGAGTTTCAGCAACTGCTTCACATATTCAGGCTGTTAGGAACCCCAACTG 
AGAAGCAGTGGCCTGGAGTCAGTTCACTCCGCGACTGGCATGTTTATCCAAAATGGGAACCT 
CAGAACTTGGCCTCTGCTGTTCCAGCATTGGGTCCTGATGGCGTGGACCTCCTCACGAAAAT 
GCTCCAATATGATCCGGCAGATAGGATTTCAGCAAAAGCTGCACTTGATCATCCATACTTCG 
ATAGCTTGGACAAGTCTCAGTTTTGAGGTTGCTTCTACTTCTAAGATCAGCC 

SEQIDN09 

GCAGCNAGCNAAGNTNNGGTNGGGNACGCCAANNANNGNTGTGCCTTTGATGACGTCACCAG 
NTATCANTCTACATANAACGGAGGNCTTGCGANNGGCTTGGNTCATTCTACNGNTCTAGGAT 
TNTCAACTCTNNTCAATTCTTCNATAACTNACCTATTCTCCTGCAGCAATATGTGAGACGTA 
ACCTAGA^TATTATTTGCCTTTATAGATATTGACTTATTCTGCT'TGC'ATATTTTATCTGCAG 
CCGGTGGCATCTACCATCAATACATTGGCTGGTGCATTGTATAAGGTGTTTTGTGCTTCACC 
TGATCAAGCTAGGAAGGAGATGCGCGATGCATGCTTTGACTATTTGAGCCTTGGTGGAGTAT 

TCTCCACAGGACCTGTATCTTTGCTTTCTGGC 
SEQIDNO10 

GTCAGTGCTTGAGCTGAACCCGTTGCTTGGACTTGACAACTAGCATCTTCTCTTTGCATGCT 
GCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTATGTATTACAAACTATTA 
TTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGAAATAATCTCT 

SEQIDNOll 

CGTTGNTTGTTTCGGGAAATTGGAACAGCATTGGTGAAGGCACTTACGGNCAAGTGTACATG 
GNTAAAGAAATTAGAACAGGGGAAATTGTTGCNTTGAAGAAGATACGCATGGACAACGAANG 
AGAAGGGTTTCCAATANCTGCTATACGTGAAATCAAAATCTTGAAGAAGCTGCACCATGAAA 

ATGNGA 
SEQIDN012 

TAAAGGACCGNTTTTGTTTCGAGAAATTGGNNCAGATTGGTGAAGGCACTTACGGTCAAGTG 
TACATGGCTAAAGAAATTAGAACAGGGGAAATTGTTGCTTTGAAGAAGATACGCATGGACAA 
CGAAAGAGAAGGGTTTCCAATAACTGCTATACGTGAAATCAAAATCTTGAAGAAGCTGCACC 

ATGAAAATGTGA 
SEQIDN013 

GGACGTTTGCATTTCGGATTNGNGCACGAGATGTTNATGATTTTAGGATTTATTTTAGTCAT 
CTTACTCGGNTGATGTTTATTCGTTTTTGTGACTTTTACTCGNGGGCGGNGGTGACCGCGTA 

CATGCTATTTATTTGATTTTTACTATGGNTATTGNTTATTGTTA 
SEQIDN014 

TTACNTTTACTGAGATNNTTATGATTTTAGGATTTATTTTAGCCATCTTACTCNGGTGATGT 
TTATTCGATTNTGTGACTTTTACTCGNGGGCGGTGGNGACCGCGNACATGCTATTTATTTGA 

TTTTTACTATGGTTATTGTTTATTGTTA 
SEQIDN015 

TATCAAATGGAGAAGTTATCAATATGAAAATAGCTGTCAAGCCACTTCAACTATTGCTAGGA 
AGCAGCAAACTGTGACGCGAGATAAACATGACACAGAACTCATTGCTAGGGGTNGNNATGAT 
NCTTGTGTAGTTCCCNAANCTGTNCCANTGNTTTAAGCAATGGTAGCCCTGGCGCTAGTGGA 

TNAGCTAATGGCTCATTATGCACAGTGTATGCTGTTCCCAA 
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SEQIDN016 

TTGAAGAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCTTCTTCAGGGTGCTGCAA 
ACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTTCACACAGTGTTTTTACA 
TGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCACCGGTTTGCTCGTC 

SEQIDN017 

GGGCCCGCCACCACCGCAACCACCTAGTTATTCCTCCGTCGAACACGTGTCTCACGAGAGTG 

AGAGTGAGAGCGTTCATCGTCAGCATGATCATCATCGTTTTCAACCACATGTGCCTTCATTC 

TTCCACCATGAGACCTCACCACATCCAGAGCTCATCGATAAGCCTTCATTTAGGGTTTATAC 

AAAGGCTGATCCCAATTACTCTCTCACTATCCGTGACGGCAAAGTCGTTCTTGCCTCTTCTG 

ATCCATCCGATCCTTTTCAACACTGGTATAAAGATGAGAAGTACAGCACTAAAGTGAAGGAT 
GAAGAGGGGTTTCCAAGCTTTGCTCTGG 

SEQIDN018 

GTNTNTGTGGCCCACCTGCTGCNAGAGTGACACACAGGNATGATCTTAGAGCNGCNATTCAG 
AAGATGTTAGACACTCCTGNGCCATACTTGNTGGATGTGNTTGTACCTCATCAGGAACATGT 
TNTACCTATGATTCCCAGAGGCGGNGCTTTCAAAGATGTGATCACAGAGGGTGACGGGAGAA 
GNTCCTATTGANTTTGAGNNGCTACAGAGCTAGTTCTAGGCCTTGCATTATCTAAAATAAAC 



SEQIDN019 

GCAAGGAGTCAAGTGGATATTTTGGATGATGGTTATAGATGGAGGAAATACGGACAGAAGGC 
TGTCAAGAACAACAGATTCCCAAGAAGCTACTACCGATGCACGCATCAAGGATGTAACGTGA 
AGAAACAAGTACAAAGGCTGTCAAAGGATGAAGGAGTAGTAGTAACTACTTATGAAGGCATG 
CATTCACATCCCATTGAGAAGTCCACAGATAACTTTGAGCACATTTTGACTCAGATGCAAAT 
CTATGCTTCCTTTTGAAACGTCCATCACTTCAATGCCTAAGGCATGACACTCAATTAGTCAC 
TTGTAAAATAGTACTACAGTATATTGTGTACATGCGTTTTGAACCTAGATGCTATATTTTGA 
AATAAAACGCAACTTCATTAGGGAATTTAATTTGATCATTGTACAACTAAAAGTAATGTTGC 
TATTTTTTTGTTTTTATCACTTTGTTTTTGCCGGAGCCATGCTCTTCATTTTAACTCTTTTC 
TTTTAGAATTAACAAATAATTTCATGTTGGAGAAAGATACGTGCCAAAAAAAAAAAA 

SEQIDNO20 

TAATGGACACGGATCTGCACCAGATAATACGCTCTTCACAAGCACTGACAGAAGATCACTGC 
CAATACTTTCTCTATCAATTATTACGTGGACTCAAGTATGTACATTCAGCTAATGTCCTCCA 
CCGGGATCTGAAACCTAGCAACTTACTACTCAACGCAAACTGTGACCTCAAGATTTGTGATT 
TTGGGCTAGCTAGAACCACTTCAGAGGCGGATTTTATGACTGAGTATGTTGTCACCCGCTGG 

SEQIDN021 

TTTGCAGCCACCTTTNACATTTCGGTAGANGATNTGTCCATAACAAGCCTGACTTTTNTAAA 

GGAATTTACAGCTTCAATTGAAGCAAAACAAGTGGCTGCTCAAGAAGCTGAAAGAGCAAAGT 

TTGTTGTGGAAAAAGCTGAGCAAGATAAGCGAAGTGCTGTTATCAGAGCTCAGGGTGAGGCT 

AAGAGTGCCCAGCTTATTGGTCAAGCGATTGCCAATAATCCGGCATTTATCACACTCAGGAA 

AATCGAAGCAGCAAGAGAGATTGCCCAGACTATCTCACATGCAGCAAACAAGGTGTACTTGA 
GTGCCGATGATCTGTTGC 
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GROUP 2 



SEQIDN022 

ctgaaccctaacgcacacaacttcactctttgctcctccaaatctctctccaatgcaggatt 
tcatcggctccgttcgccgatctctggttttcaagcagtccggagacttcgataccggcgct 
gccggtgtcggcagcggattcggaggcttcgttgagaaactaggttcgagcattcgcaaatc 
gagtattggaatcttctcgaaagctcatgttcctgctcttccgtctatttctaaagctgagc 
tgcccgcgaaggctcggaaagatgacactccgccaatccggtggaggaaaggtgaaatgatt 
ggatgtggtgcttttggtagggtttatatggggatgaatgttgattctggagagttactcgc 
tataaaggaggtttcgattgcgatgaatggtgcttcgagagagcgagcacaagctcatgtta 
gagagcttgaggaagaagtgaatctattgaagaatctctcccatcccaacatagtgagatat 
ttgggaactgcaagagaggcaggatcattaaatatattgttggaatttgttcctggtggctc 
aatctcgtcacttttggqaaaatttggatccttccctgaatctgttataagaatgtacacca 
agcaattgttattagggttggaatacttgcataagaatgggattatgcacagagatattaag 
ggagcaaacatacttgttgacaataaaggttgcattaaacttgctgatttcggtgcatccaa 
gaaggttgttgaattggctactatgactggtgccaagtcaatgaagggtactccatactgga 
tggctcccgaagtcattctgcagactggccatagcttctctgctgacatatggagtgtcgga 
tgcactattatcgaaatggctacaggaaaacctccttggagccagcagtatcaggaggttgc 
tgctctcttccatatagggacaaccaaatcccatccccccatcccagagcatctttctgctg 
aatcaaaggacttcctattaaaatgtttgcagaaggaaccgcacctgaggcattctgcatca 
aatttgcttcagcatccatttgttacagcagaacatcaggaagctcgcccttttcttcgctc 
atcctttatgggaaaccccgaaaacatggcggcgcaaaggatggatgttaggacctcaatca 
ttcctgatatgagagcttcctgcaatggtttgaaagatgtttgtggtgttagcgctgtgagg 
tgctccactgtatatcccgagaattccttagggaaagagtcactctggaaactaggaaactc 
tgatgatgacatgtgccagatggataatgatgattttatgtttggtgcatctgtgaaatgca 
gttcagatttgcattctcctgctaattataagagttttaatcctatgtgtgaacctgataac 
gattggccatgcaaatttgatgaaagtcccgagttgacgaaaagtcaagcaaacctgcatta 
tgatcaagcaactattaagcccactaataaccccatcatgtcatacaaggaggatcttgctt 
tcacatttccaagtgggcaatctgcagccgaggatgatgatgaattgacagagtctaaaatt 
agggcattccttgatgaaaaggcaatggacttgaagaagctgcaaacaccactatatgaagg 
attctacaattccttgaatgtttccagcacaccgagtcccgttggcactgggaacaaggaaa 
tgttccaagtaacataaacttaccaccaaaaagcaggtcaccaaaacgtatgcttagcaga 
ggctctctactgccattgaaggtgcttgtgctcccagcccagtgactcattccaagcgaat 
_tcaaatattggtggcctaaatggtgaagctattcaggaagctcagttgccgaggcataatg 
aatggaaagatcttcttggttctcaacgtgaagcagttaattcaagcttctctgagaggcaa 
agaaggtggaaagaagagcttgatgaagagttgcaaaggaaacgagagattatgcgtcaggc 



a 
a 
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a 



attaatgatcctgcaaccccagaacagagagttagatgtcttgagcaggtatacgaacgtg 
^ggttttcttgacccgttactacaggaatatcagcgcttgtcagatagagtgagctgttact 
acaggaatatctgtcaacctgttaatcatattataaaatgccaataatttgcgttgtattcg 
ttttgatcattctcctgagagcattgtaagaaaaatgcaggcctttttataacctatataag 
tgctctctcatggtagttgccaatattaaaacgcagagaaaagtcgagttctcatctgctga 

attatttgtaaaatgtgatatattaatgtatttaccgtcttacaacc 
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SEQIDN023 

ccacgcgtccgtgatatgggatgtcacattgatggatttattgctgtagttggacatacaca 

tgttcttcacgaaggaccagttactggtagacctgctgacgtcattgcagctgctaatacag 

ccgctgaagttgctttgaggcttgtgagaccaggaaagaagaactcggatgtaacagaagct 

attcagaaagttgctgctgcctatgactgcaagattgttgagggtgtcttgagccatcaaat 

gaagcaatttgttattgatggaaacaaagttgtattgagtgtgaccaatcctgaaacgagag 

tagatgaagcagaattcgaggagaatgaggtttactccattgatattgtgacaagcactggt 

gaaggaaagccaaagttgttggatgagaaacaaacaactatctacaagagagccgtggacaa 

gagctataacctgaagatgaaagcatcaaggtttatcttcagtgaaatcagtcagaagttcc 

ctatcatgccatttaccgcaagggatttggaggagaagagggctcgtctgggcctagttgag 

tgtgttaaccatgagcttttgcagccatatcctgttctacatgagaaacctggtgatttggt 

tgctcacattaaattcacagtgctgttaatgcctaatgggtcggataggattacatctcatg 

ctctccaggagctgaagcctgcaaagtcgatagagaatgaacccgaaatcaaagcctggctt 

gcccttcccgttaagaccaagaagaaaggcggtgggaagaaaaagaaagggaaaaaaggtga 

caagacagaagactcatctcaagctgagccaacggaaggatagagaaatggtttcaaatctt 

gataaatagcaattttgaggtgcttgatcgatcaacttcactgaaactattggttcactgtt 

ggtcggcactttcagctgcctttgttcttccttgtggggctttgctatacaagggacagaca 

gttattgtcctcttgtactgtcatgttaaattactcagttttccaatgctattcaacatgct 
ctcaatcggtctttaaaaaaaaaaaaaaaag 

SEQIDN024 

ccacgcgtccgcaaaaccctagctcaaatcccgtttgcctccattttcattccatcaacaaa 

aacctaagtttatactcagcttgagacatttgataactatgtcggacgacgagagagaagag 

aaagagttggatctgacaagtcctgaggttgttactaagtacaaaaatgctgctgaaattgt 

taacaaggctctgcagttggtggtgtctgaatgcaagccaaaagcaaagatagttgatcttt 

gtgaaaaaggagatgccttcatcaaagagcaaactgggaatatgtacaagaatgtgaagaag 

aaaatcgagaggggtgtggcatttcccacctgcatttcagttaacaataccgtgtgccattt 

ttctccactgtctagtgacgagacagtattggaagaaggtgatatggtgaagattgatatgg 

ggtgtcatatagatggctttattgctgtagttggtcatacacatgtgctccaggaaggacca 

gttactggtagagcagctgacgttgttgcggctgctaatacagctgctgaagttgccctgag 

gcttgtgagaccaggaaggaagaactcggatgtaacagaagctattcagaaagttgctgcgg 

catatgactgcaagattgttgagggtgttttaagccatcaaatgaagcagtttgtgattgac 

ggaaacaaagttgtgttaagtgtgtccaatcctgaaacgagagtagatgatgcagaatttga 

ggagaatgaggtctattcaattgacattgtaacaagcactggtgaaggaaagccaaaattgt 

tggatgagaaacaaacaaccatctacaagagagctgtagataaaagctacaacctgaagatg 

aaagcgtcgaggtttattttcagtgaaatcagtcagaagtttcctgtcatgccatttacagc 

aagggatttggaggagaagagagctcgtttgggactcgttgaatgtgttaaccatgagcttt 

tgcagccctatcctgttctacatgagaaacctggtgatttggttgctcacataaaattcaca 

gtgctgttgatgcctaacgggtcagataggatcacaactcatactctccaggagctgaaacc 

tgctaagacaatagaggatgaacctgaaatcaagacctggttagcccttcccgtaaaaagca 

agaaaaaaggcggcgggaagaaaaagaaagcgaagaaaggtgagaagacagaagactcatcc 

caagctgaaccaatggaaggagaatcaaatggtgctgaatcttgatatgttgctagaacttt 

gatttgattcaattccaagaactatttgttgattgttagttaaatgtgggatattgaggtag 

ttgtggatctttctttgcggcattttgcaatacaagaatggcatggacagttgttgtccttg 

tcttgacacatttgtcatgctggaattattaagtggggtttccaatgctataatgtcatgtg 

tatcaaaaaaaaaaaaaaaagggcggcaactctagagtatca 



FIGURE 4 (continued) 
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SEQIDN025 

ccacgcgtccgcttggggattagcaggttgtcgacaaagaaaattcatttgtttcctacgat 
cacacaagtcgtggttgttgcagatccgctttcgctaaggggaaactcaaaagcccagttcg 
tgtagttcatccaaagatgagttccagcaaaagggttgggaagtcttctaattcatcaggaa 
agcagaaagctatatgcgaaacaactactacaccaacggttgat'gatataaatgtaggcgta 
gaagatatggggttgaactccgatcaaaatgatggatggatagtgtgttctagaaagtccaa 
gaacaagggtggaagcagcagtgctggaatgaagcaatggatttctcagaatcccactccaa 
aagccaaactgggaatgcgtaacaatattgttggatcatcaggacaggggtctaggaataac 
tggtccacacctaattatcatcctcgaaaacctgctggcagagaatgctacacaccgacacc 
cgctgcagttcctcctgccctgaagaatggttgggattggtcatctgtcgctcgttccaatg 
aggaccatgatacttattcccctgtcgctgatgtcaaggcttcctgtgaacatgatggagag 
gataatgaatcggatttgcctgatgatgacagtgatgatgagcttccgagtgatgacgactt 
tgatgatcactcggatgtaaatgaaatgagtcatgaggtactcaaggaaagtcgttggttca 
agaacc ttt tcaaatgtcttgacagtttgactgtcact gaga ttaat gat ccggaaagacag 
tggcactgccctgcatgcaaaggtggtccgggtgcaattgagtggtttccagggatacagtc 
agtgatgaaccacgcaaaaacgaaaggatttaggatgaaattacacagacaacttgctcaac 
ttttggaggaagagctgcgtcggaggggaacttctgttgtacctccaggtcaagtgtatgga 
agatggggtggcggtgaatatgaagataaggaaatagtgtggccaccgaccgcgattatcgt 
gaacacagtgcttgagaaagatgaaaatgacaagtggattggaatgggaaatcaggagctgc 
gtgattatttcagctcttatgctgctgtcaaggcagcgcgaagctcatatggtccacaaggc 
catcgtggtatcagtgtgttgatttttgaggccactcccgtgggatacatggaggctgtact 
tctcagtgagcagttttctgaaaaaggaagtgatagagatgcatgggaacacaatccagttc 
tcttttatcctgggggaaaacgtaagctttatggttacatggcagagaaaagagacatggac 
aactttaaccggcattcacatgggaaatcaaggctgaagttcgagatgaggtcatataaaga 
aactgtttcgaatccagcgatgcagatgtcggaggataatcaacagctcatatggttcaaga 
accaagcctctaagcaccaaaagcgggctaaagctactgaagagtctctaagactggtgagt 
gaaaagcaccgtcagacagtcgaagagaacaagattgtcagactgagaactaagatgcacca 
tgaacggaacaaggaagagatggaatatctagagcagttttttaatgatcagttgaaaatga 
tttatgatgccaggactgctgaggaggacaagtttgaaaagatacagcaggaacagcgtgag 
atgatctatcaatctaatgcaactatttcctcggctgaggatcatcgactcagggcagagaa 
agttgcgaaatttatcaaacttcaggacaaggatatggaagaatttgtggaagagagggata 
atctgataagagctcatgaagatagggtaggttcaatgagacgcaaatacttgctgcaatac 
tcggaagaggcagttgcacttgagaagaattttgatctcgaactggctaagctgatggagaa 
gtactcatcaaagcaatctgagcaggtcaacagcagtgatgccgtgtgaccctatagtaata 
ctattcaagcgccgttttagctttaaatttctgtgaacttgggattcttcactgacttttat 
aatcctggtctgtccatgtgttttgatgatgctaaagaaatgattctaatagttatattata 
tcctaaaacatatggcttgaactatttgttctagaaaaaaaaaaaaaaaaaaagggcggcc 
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SEQIDN026 



aacagaaccaagaacaagaaqaatc?J?^!! 9 ? 9 aaagaagaagaaaaa 9 ca ^ 
aa ggttgaaacLttaa?Jgg?L^at;^ g tgaaaatbtta 9tgggtctaatggc 

aactggtaaggatgctaatgagtcLagtacaaa^ 
ttccaa g tgat g tgttaga?tgt?gcaSLJt a ?aaaiL 

tc ^ggccttttcttttagatqaccata??t^ ^ g 9 aaaccatcaccaa ttcaatcacat 
gttctctgcgatgctgggaaJcctactgJtgS 

taagcatcatcaaaaagcttctctt^JK? ? ? 5 gtCtatatg 5 t 9gagtcga 
gtttgcaggatatgatggaaatgggaacaSS^ 

g a tgaagctgatcgga?gctcqa???aa^^ Cttaaaagc 9gtttcttttgtg g tgcta 

tagctcaagaattcatggatc?t g catcLatL 9 aggta^^ 
gccaaccatgatgtcatgcaaattatcaaa^?; 9 ? 9gttCagaagattt 99ct 

acagtgcttgctggaaaLtacLLaltt^agaL'oL 3 ^!^ 

acaagaaggaagcatctcQaattaa^s?^; f 9 9 gtattggttttt 9tttt g t 
attagcg g tgaLagcaa^aca ga ctca a ict a r a9 %^ 

ttgtgataaattatag??t a cc g Sai^^ 9 CtCgaggtCtggata tcccagat g tt g aag 

gggcgagctggtaaalaag^taacSta^ 

tggggagttga g taaatgtt g SSga 9 gag a ct^ 

ttggaacccatgtaaagaaaaaaa 3 ^^ 9 ? f 59 gtgCCagctgcccttctt aatt 
gcaaatgctcca 9 aaggc 9 ta C a:a 99 a a t a a\ C a g att 9 t1S 

aata tcattattaccaaagcaacacaact a cat ga aa a toS? a tgaggaagatt 9 a 9aagc 
gcaatcatttggcqaatacatoi-»« a a^ cca ^tg aattggctcatcatcctgacattccgt 

t g agtttgaaatact? g tgtgt? a tt gC tacc a taat^ 
^^tttttctaggatcLaatgLg^^ 

^gtatgtatgattgcaccttccaaattacJac^t^ attataattt 9 ccc ttttaagtg 

aa tgagattcaaaactggttcat a caaaaai^ a f tggCaaaaggt ^aaaaatattcgaa 

l. u^ct Lgcaaaaaaaaaaaaaaagggcggcc 



FIGURE 4 (continued) 



WO 03/085115 PCT/EP03/03703 

8/140 



a 
a 



a 
a 



SEQIDN027 

ccacgcgtccggtggaacgccacgttgccattctctcttcggtgacaagcttcaaacgccag 
gcgtctctctcgtatctgagtgctggtgttttttcttctcaggttcaagttccggcagttaa 
tgcggcaaattcaaatttagatgttatgcaatagctgcgtaagatttgtgttttttcaagca 
gcacatattqatacacacattgtgcaaaggcaatttatcaaactcattaaaagtttgaatgc 
aactcgtagcataagttattcttggaatagtgtatatacagcaaggagaacgatgatggtgg 
atactggagcaactgctaaaggaggacctgtcgttgatgtttcaccggagaaggatgataat 
aatggtggtttcgctagcggaggatggaagagtgaagatggaagactgagttgtggttattc 
aagctttagagggaaaagagccaccatggaggatttttatgacattaaaacttggcaaagtt 
gatggacaaacaggtagcttatttgggatatttgatggccatggtgactctcgcacagctga 
gtttctgaagaaacatctctttgagaatctaatgaaacatccagagttcccaacgaacgcca 
gctggccataagtgaaacatatcaacaaacagacatggacttcttagattctgaaaaagat 
_ccttccgagatgatggttccactgcttcaacagcagttctagttggtaaccatctctatgt 
tgccaatgttggagattcacggactataatatcgaagggcggaaaagcaattgctctttctg 
ggatcataagcccaatcgaactgatgagaggaagagaattgaaagtgccggaggtgttgtg 
-tgtgggctggtacctggagagttggtggtgtattagcaatgcacgtgcttttggcaaccgt 
atgttgaagcaatttgttgtggctgaacctgagattcaggatcaagagattgatgaggaatt 
agaactactcgtgcttgccagcgatgggctctgggatgttgtaccaaatgaggatgctattt 
cacttgcacaagcagaagaagaaccagaagcagctgctaggaagctaacagaaactgcattt 
actcggggtagtgctgacaatattacctgcatagtggtgaagtttcaccacaagaaggttga 
accagaggggagccagcaaggttgaagaatttgttgatgctgcatctgccttttcctggtgg 
aaggctgcttcaatgatgccggtgcaagttgctgacgatagcatcacaggggctgtcatttt 
ttcattcatttctttgcattgtttttccccgtcatcctgtttaactgttgtatttaaggtgt 
ctgcgtttgtgcgtctgctttctccttttctgtagaggtattgtctggataaactttactgt 

gaaacgtagttaaaaggttaaaaaaaaaaaaaaaag 
SEQIDN028 

ccacgcgtccggaagaaatggttgaattcatggaaaaggtcttcaactccctcggctcagaa 
gaactcaccgtggaggaacgaaacctcctctccgtcgcgtacaagaacgtgatcggagcgcg 
tagggcatcgtggcgtattatctcatcgattgagcaaaaggaagagtccagagggaacgagg 
aacacgtaaactctatccgcgagtacagatctaagattgagaatgaactctctaagatctgt 
gatggcattctgaaattgctcgatgcaaagcttatcccttctgcagcatctggtgattctaa 
ggtgttttacctgaaaatgaaaggagattaccaccgctatttggctgagttcaagaccggtg 
ctgaacataaggaggctgctgagagtactctcactgcctacaaagctgctcaggatattgca 
actactgagcttgccccaacacatcccatccgacttggactggctcttaacttctctgtgtt 
ttactatgagatcttgaactctcctgaccgcgcttgcaatcttgctaaacaggcctttgatg 
aagcaattgctgagttggacacactgggcgaggagtcttacaaggatagcactttgatcatg 
caacttcttcgtgacaatctcactctctggacatctgatatgcaggatgatggggctgatga 
aatcaaggaagatcccaaacctgatgaagccaaaaattgaaggaattgaaactctctaattt 
gcttttcacttcttcctggttgtttttattggaagaagctgattatcgtaatttcctttact 
attatggttttccgctagggggttgtcttattggaaatgaacaacttttaatattgatgttt 
cagaagttccatctttaatttaatgtggtttttctggtggtaaaaaaaaaaaaaaag 
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tct^^r tactgtttc ^ 

aoo^r ggaag H 9gatatCtaCaCatCtaaCtgca 9 aatt g ttac ^aaacactgat 
laal.Jt g l^ g& tattctCaaatattcaa ^ ta 9 c attttgaaaaatgtcagatgata 
ac?aaa£?f tttgCat C = aattcttcaaaccaat ca 9^ca g tctctgc?cacaaagatt 
gctaaacttgaggcaagaatggtgggcaaagcctcatctacatctacatcccgaqctacttc 
^ gg ^ gCC ^ Cag f caagtt ^ 

ctgtctctagtgattctgatgatgatgatgataatggaagggaatttctcatacaagcaaac 
actcagaagcggcgcaaactcgaggatgacaacagctcaacttcatttgaacatgtgqaaac 
L C a a ? s C l 9 ! aCtgtgaaaaagatagta g acaata "gacacaagcaaagtgggttcagatg 

caaat^^! aaaCaaagCCgt9tCaagggaCaaaCtaattctggtaga 
cgagttagtgatcagaccaagtcacaagcagtttctgtgtcaaatggtcagctcgagaactc 

ttaccagaaggatggtttgccaaaagagcaaattgggcacgatcgacagactgtattcgaaq 
aggagatcacttctttacgggcaaaagttgtggctttggaggaagagcttaagaaatcccgt 
caagaggcatcagattatcaacatcagtgtcaacagctggaaaaggaattgaaggatcttaa 
agattatgagcagcagacaaagccaaagagaacgaaaataatatctgaattgttaatatctq 
tttcaaaagctgagaggcaagaggcacgaatgaaagtgcgacaggaatctttgagactgggc 
Tztnltnr? 9 taatCagagCtggaaCCattatttct g a gg^tgggaagatgggcaagcact 
ataaJt?nn? aa ? CtCagCttagaaacttatta g aaa ^aaagaagctatt g aacggcagc 
gtaaattgctcaagaaacgacaaccagataaaagtgatggaggagatgtggagggaggtttq 

tgaggaagatgtgataatgcgtgagagggaccgatatgaactagagaaaggaaggctaattc 

gtgaaatgaaacgcatacgtgatgaagatggttctcattttaacaattttcagattttqaac 

"^ gata ^ gCCCtC " aaaccttctt ^ aaaa gg^ggatttagtgaggt g tacaaggcttt 

cgacttggtagaccatagatatgttgcatgtaagctacatggactaaacgctcagtggagtg 

aagagaagaagcaaagttatatacggcatgcaatcagggagtacaacatccacaagactttg 

gtgcaccatcacattgtgcggctttgggacatttttgagatagaccaaaacaccttttgcac 

tatcttggagtactgtagtggaaaggaccttgatgcagttctcaaagcaacacctgtgttqc 

cagaaagagaagcaagaatcatcattgtgcagatttttcaaggccttgtctacttgaataag 

oagtcacagaagatcatccattatgatttgaagccaggcaatgttttatttgatgagtttgg 

cattgctaaggtcactgattttggccttagcaagatagtggaggatgatgttggatcccagg 

ggatggagctaacatcccagggagctggaacgtactggtatctacctcctgaatgctttgaq 

ctaagcaagacacctcttatatcctcaaaggttgatgtctggtcagctggtattttgttgta 

taa^t 9 ?t t ^ ggCaaaCgtcccttt ggg c ^gaccagtcacaagaaagaatactaaggg 

aggacacaattattaaagcaagaaaggttgaattccctacacgaccagctgtctctaatqaq 

gcaaaggagttcattcgtcgttgtttaacatataatcaagcagataggccagatgttttaag 

tattgctcaagacccttacttgacatactcaaagaaatgataggaggatgttaatcccaact 

n^^r 0993 ? 9 ? f tt999aCgaggattggtgctCaaaggaatttt g tata gttgtaaag 

ccatgtaattttttgtccctgtaccttcgactagagtggggcggctcaaggggagctttgct 

ttaggccccaaaattttgggggcatttgcatctatacccagtttttgggttaacttttaact 

tatatccgcattgcaaaaaaaattgcaagcatacctacttttcgggtaacttcagacattcg 

ggtctgaagtagcaaaaatttatgtctgaagtttgaacttcagaatgttttgcctgaagtqt 

agtaaaacttcagatatttttgcctgaagtttggcctgacttgcaaagtcaatcacgcaaac 

:.° ag t ! Catagtgcaatggcaaacttca 9ctcaataaaattacagcatgttttggctgaag 

t.tttgtuttgtaattgttgaacttcagcattttaggaactgaagtttgttttgtaattgct 

tin*! ^ 9 ?! tt ^ aggggtgaa g ttt g tttt gtatttgctgaacttcagcattctaggag 

ttgaaatttttgtttatatttgctgaacttcagcattcttagagctgaagttctaagtctgc 
acacggaaatgaggaagataacc 
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ccacgcgtccgacgaaatccaaccgtcgaatctcaggcaacaggcggcagctcatttcaccg 

ctgtaacaaaaattcgagagaatggcaatggtagatgagccattgtaccccatagccgtgtt 

aatagatgaacttaagaacgatgatatacaattacggttgaattcaattaggaggttatcga 

ctattgcacgtgcccttggtgaggaaagaactcgaaaggaattgatcccttttttgagtgaa 

aacaatgatgatgatgatgaggtgttattggcaatggctgaagagcttggtgtgtttatccc 

ttatgttggaggtgtagagcatgctcatgttttgctcccgccgttggagacgctttgtactg 

ttgaggagacctgtgtgagggataaagctgttgaatcgttgtgtaggattggatctcagatg 

agggagagtgatttggttgattggttcgtccctcttgtgaagaggctggcagctggtgaatg 

gttcacagctagagtttctgcctgtggactctttcatattgcttactcaagtgccccagaga 

tgttgaaggcagaacttcggtctatttacagtcaattgtgtcaagacgacatgcctatggtg 

cgaagatcagctgccacaaacttggggaagtttgctgctactgttgaatctacttacctcaa 

gagtgacatcatgtcaatatttgatgatcttacacaggatgatcaggattctgtacgcttat 

tagctgttgagggctgtgctgcacttggcaagctgttggagccccaggattgtgttgcacac 

atcctgcctgtcattgtcaacttctctcaggacaagtcttggcgcgtccgctacatggttgc 

taaccagttgtacgaactatgtgaagctgtagggcctgagcccactaggacggatttggtgc 

ctgcctatgtccgtttgcttcgagataatgaagctgaagttcgcatagctgctgcagggaaa 

gtcaccaaattctgtcggattcttagtcccgagcttgctattcagcatattcttccctgtgt 

gaaggaattatcatcagactcttcacagcatgtcagatctgctttggcttctgttataatgg 

ggatggctcctgttttgggaaaggatgcaaccattgagcatcttcttccaatatttctttcc 

cttctgaaggacgagtttcctgatgtgcgcctgaacatcattagcaagcttgatcaagtcaa 

tcaggtgattggaattgatttattatcccaatctttgttgccagctattgttgagctagcag 

aggacaggcattggcgagtccgtcttgcaataatagaatacatacctctattggcaagtcaa 

ttgggcataggattttttgatgataagcttggtgccctttgtatgcaatggttacaggacaa 

ggtttattcaatcagagatgctgctgctaataacctaaagcgtcttgcagaagaatttggtc 

cagagtgggcaatgcagcatataattcctcaggtcttggatatgactaccagtccacattat 

ttgtatagaatgacaattcttagagcaatttcattgcttgcacctgtaatgggctctgaaat 

aacttgttctaaattgctgcctgtggttattactgcaacaaaggatagagtgcccaacatta 

aatttaatgtggcaaaggtgttgcaatcccttatacctattgttgaccactcggtggtggag 

aaaaccattcgccctagtttagtagagctagctgaagaccctgatgttgatgttcgctttta 

tgccaatcaagcacttcagtcaattgataacgtcatgatgtcaggctagagaatataactgt 

ggtgagagtactacaaatctctcttcaaatccctctttggtaggattttgctctcacacgaa 

gacgcaaaagagaaaatgtgcaagcaaaatgcattctgttgagcttggagtcgtatattgtt 

actaattcttttgtaggatttgacattcaagatgctgtgacactaatgaacaccgagtgttt 

tttcatgtaaagttactgccgtactatttagatctgctaagctcatgtatttgcttttgtta 

gtgtacttttttggtgtttgaacttacaactttttacctgcgttattctagcagatttgttg 

cgtttacattagcgtttgcgtttcttcctagccgatgttatgtttgagcagtgcccccgcca 

ccctctctttttctcaggtcttatgctttctatgtgttttttcatgccgatagaatgtatgt 

ggaacttttagtacttattattttttatgttgtatttgttggcttgagatgagcaacataaa 

taataagaaactggg 
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SEQIDN031 

ctgtacaaaaaagcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtcc 

gaggagattgagctgagctgactcaatgtttccgagattgattcaaccacaaggggaagatg 

aatataatatgaatgttgggattcatcatactcataatattaatggagatccttgccttgtg 

ctgacgtcagatccgaagcctcgacttcgttggactgctgacct'tcacgaacgcttcgttga 

tgctgttactcagcttggcggtcccagcaaagctacgccaaaggcgataatgcggacaatgg 

gtgtcaagggactgaccctcttccacctaaagagtcaccttcagaaatacagactaggactt 

acagctacatattcattagagagcccttgttctggtggtactcctcagcagttgccggcatc 

ggacttgaatgaaggttatgaagtcaaggaggcattgagagctcagatggaagtgcaaagta 

aattgcacctgcaagttgaagctgagaagcacttgcaaattcggcaggatgctgaacaaagg 

tatattagcatgctggagaaggcctgtaaaatgcttgctgatcaattcattggtggtgtagt 

tactgaaaatgatcaagagacttgccaaggattaggaacaaggacacaagttagccctcttt 

gtaatccacttggattatgcccctcggaatctgctgatcttgttggaatccatggtccagaa 

gaagtttcccccagaatccatccacaattcaccgattgttccactgaaagctgcttaacttc 

gcatgagagtcctgctggacttcccctagaaggaacttcacctggaggaagaaaacgagggc 

cgaatggagattcaacacatgcatcagttgttggggtgaagcagatatgatatcgtcaggtg 

ttcgtctgcttcaagttaatcgctttgggattactagctctaatgttcaaaatgtctcttct 

taagagattagtgctgagtttatctacagccattgattctcaaactgcatattgcggtttct 

gggaatactgatgggccttggacttgtcaagttgtaaatgcaagctgatgactttctaactc 

taactgcgccccctgaacattaaatcctaaaaaaggaagaaaattgagatgcgag 

SEQIDN032 

agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgaaagaagaga 
aaaagatgggtgctgacaaagggaagaagcaaaaagtggaggaagagaacaacaccattgat 
ggtgagctcgttttttccattgaaaaattgcaagaaatacaagacgagctcgagaagatcaa 
tgaggaagcaagtgataaagtattggaagtggaacagaagtacaatgagatccgcaagcctg 
tctatgacaaacgaaacgacatcattaaagctatcccggacttctggttgactgcttttttg 
agtcatcctgtcctaggtgaacttctaactgaagaagaccaaaagatcttcaagtttctaag 
ttctattgaagttgaagactctaaagatgtgaagtcgggctactcgataacctttaacttca 
atgcgaatccttattttgaaaatacaaagctcacaaagacctataccttccttgaagatgga 
cccacaaagatttctgctacaacaataaaatggaaagaaggcatgggcattcctaatggatt 
tgcacatgagaagaaaggaaacaagcgatctcatgctgaggaaagcttcttcacatggttca 
gtgaagtcaatcaaaaagatgaggatgaggatgaggccctagagattcaggatgaggtcgct 
gacataattaaggatgacttgtggccgaaccctctcacctattttaacaacgagcctgatga 
agaagattttgatggtgacgagggaaaggacagtgaaggctctgaagacgaagaggaagaag 
aagaggaggatgaggatggtgatgaagaatgaaggcagtaaactgttcaagacccctatttt 
gggatctcgtcttcagcggttttaatcatcagggtttaatgtctgtaaagaggctttgaatg 
ttgccaaagaacagaataactgtggtgactataccttttcttctcttgtatggttataactt 
ataagcaaaatatctaattccggaggttccaaaatgttttcattaggctagttcgattaatg 
aagtgtttgtctggcaaaaactgataatgttaggttattgagttatg 
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SEQIDN033 

ccacgcgtccgcccacgcgtccgggcagctcatttttaccgccgtaacaaaaactcgagaga 
atggcaatggtagatgagccattgtacccaatagccgtgttaatagacgaacttaagaacga 
cgatatacaattgaggttgaattcaattaggaggttatcgactattgcacgtgcactcggtg 
aggaaagaactcgaaaggaattgatcccctttttgagtgaaaacaatgatgatgatgatgag 
gtgttattggcaatggctgaagagcttggtgtgtttattccttatgttggaggtgtagagca 
tgctcatgtcttgctcccgcctttggagacgctttgtactgttgaggagacttgtgtgaggg 
ataaggcggtggaatcgttttgtagaattggatctcagatgagggagagtgatttggttgat 
tggtttgtccctctcgtgaagaggcttgcagccggtgaatggttcactgctagggtttcagc 
ttgtggactctttcatattgcttactcaagtgccccagagatgttgaaggcagaacttcggt 
cgatttacagtcaattgtgtcaagacgacatgcctatggtgcgaaggtcggctgcgacaaac 
ttggggaagtttgctgctaccgttgaatctgcttacctcaagagtgatatcatgtcaatatt 
tgatgatcttacacaggatgatcacgattctgtacgcttattagctgttgagggctgtgctg 
cacttggcaagctgctggaaccacaggactgtgtggcacatatcctgcctgtcattgtcaac 
ttctctcaggacaagtcttggcgcgtgcgatacatggttgctaaccagttgtatgaactatg 
tgaagctgtagggcctgagcccactaqgacggatttggtgcctgcctatgtccgtttgcttc 
gaga taatgaagctgaagttcgcatagct get gcaggaaaagtcaccaaattctgtcggatt 
cttagtcccgagctagctattcagcatattcttccctgtgtgaaggaattatcatcagactc 
ttcacagcatgtcagatctgctttggcttctgttataatggggatggctcctgttttgggaa 
aggatgcaaccattgaacatcttcttccaatatttctttcccttctgaaggacgagtttcct 
gatgtgcgcttgaacatcattagcaagcttgatcaagtcaatcaggtgattgggattgattt 
attatcccaatctctattaccagctattgttgagctggcagaggacaggcattggcg^agtcc 
gtcttgcaataatagaatacatacccctgttggcaagtcaattgggcataggattttttgat 
gataagcttggtgctctttgtatgcaatggttacaggacaaggtttattcaatcagagatgc 
tgctgctaataacttaaagcgtcttgcagaggaatttggtccagagtgggcaatgcagcata 
taattcctcaggtcttagatatgactaccagtcctcattatttatatcgaatgactattctt 
agagcaatttcattgcttgcacctgtgatgggctctgagataacttgttccaagttgctgcc 
tgtggttattcatgctacaaaggatagagtgcccaacattaaatttaatgtggcaaaggtgt 
tgcaatcccttatacctattgttgaccactcggtggtggagaaaaccattcgccctagttta 
gtagagctagctgaagaccctgatgttgatgttcgcttttatgccaatcaagcacttcagtc 
aattgataacgtcatgatgtcaggctagagaatataactttggtgagagtactagaaatctc 
tcctcaaatcctctttgatagtcttgggattttgctctcacacgaagacacaagggaaaatg 
tgcaagcaaaatgcattctgttgagcttggagtcgtatattgttactaattcttttgtagga 
tttgacattcaagatgctgtgacactaatgaagaccgagtgtttttaaatgtaaagttgctt 
ctgcactatttagatctgctaagctcatgtatttgtttttgttagtgtacttttttggtgtt 
tgaacttcccacgttttctgcg 1 
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tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 

ccgagaaattagcagttagagacactgagaagcagcagctctcttcctcagctgctgtgtgc 

ttaggcaaagaataaaatgggggcagacaaagggaagaagcaqaaagtggatgaggaaaaca 

acaatgttattgatgaaaagctcattttttccattgaaaaattgcaagagatacaagacgag 

ctcgagaagatcaatgaaaaagcaagcgacgaagtgttggaagtagaacagaagtacaacga 

gatccgcaagcctgtctacgataagcgaaatgatgtcattagctctatttctgacttctggt 

tgactgcttttttgagtcatcctgttcttggtaaccttctcactgaagaggaccaaaagatt 

ttcaaatttgtaagttctattgaagtggaagactcaaaggatgtgaaatcgggtcattcaat 

cacgtttaactttaagcccaatccttattttgaaaattcaaagctctcaaagacgtatacct 

tccttgaagatggacctacaaaaattacagctacaacaataaaatggaaagaaggcatgggc 

attcctaatggagttgctgacaagaagaaaggaaacaagcggtcccacgctgaagaaagttt 

ctttacatggttcagtgaagtcaatcaaaaaggtgatgtggatgatgacgaaaatgagattc 

tggacattcaggatgatgaggttgctgaaataatcaaggatgacttgtggcctaaccctctc 

aattattttgaccatgagcctgatgaagaagatattgagggcgatgagggaaaggacagcgg 

aggctctgaagaggaagaagaagaggaagatgatgaagatgaagaagacgaatgaactgttg 

gtagaccttgtgtttgatttgagttctcatcagtgtttcaatcatcagagttggtctctgta 

aagaggtttcggatattgcagaaaaattgaatgacatatagtggtgactctaatttttagtt 
tcagtga 



FIGURE 4 (continued) 
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SEQIDN035 

ccacgcgtccgatcttgaaaaagttccattcttttttctccttctgcttcttcttctgattg 

aagattctgaacctgttctaagtttatggataggtggaatacttcactgagtggatattaca 

actacccttttcgattcttgcatttttattccatttttgttgtgattgtgttttcttccatt 

ttccctattatatcagctgggaggaggtcagatggggttattgt'aactcaagctgattttca 

agcacttaaggccattaaacatgaactgattgattttagaggaatcttgacaagttggaatg 

atagtggtttaggagcttgtgctggtggatggataggtataaagtgtgttaatggggaagtt 

atagctatacagttgccttggaagggattaggtggcacaatctctgaaaaaattggtcaatt 

acaagctcttagaaagcttagtattcatgacaatgttattgctggtcttgttccaacttcgt 

tgagtttccttccaaatcttagaggtgtttatcttttcaataaccggctttcgggttcaatc 

ccaccaaccattggcagatctccacttcttcagactcttgatcttagcaacaatcagctcac 

tggtactatccctcctagtcttgcaaattccacaaggttatacagactcaacttgagctaca 

atgcactttcaggttcaatcccagtaagttttactcaatccccttctcttacttttcttgca 

cttgaacataacaatctttctggctctattcctgatacttggggtaatgttgttgtgaacga 

taagtcttatcaacttcagtatcttacccttgatcacaatcttttatatgggaaaattccag 

cttcaattagcaagttaagtatgcttgaggagattaatcttagtcataaccaaattaatggg 

actattcctgatgaattaagtgcacttacaaggcttgctcttcttgatttatctaataattc 

cataaatggaactattcctgttagtttctccaatctttcagctcttgttactttgaatttaa 

agagcaatcttttggataaccaaatcccagatgtcatatatagattgcaaaatctttcagtg 

ttggatttgagtaagaataagctcactggccatattcctgccaccattgggaatatttctag 

gctcaactcacttgatttatctgaaaacaatttcactggtgaaatcccaaactctcttgttt 

ctttggcaaatttgacttctttggatgtctcttataacaatctttctggggttgtcccatct 

cttctttctaagaagttcaattcaagtgcttttgttggaaatctagagctatgtggatatag 

tccctcaactctatgtgcttcaccacctcctcaaactcttcctccttctcctattggtgggg 

ttgccaagcctcgccatcgcaaacttagtactaaggatatcattctcatagcatctggagca 

cttctagttgttctacttcttttgtgttgcatgctactttgctgcttgattaggaaaaaggc 

aaattcgaaagcaaaaaatggtagtaaagccagtggcttagctaccacagggagaggtgcaa 

agccggttccagcagcagcagcaggtgctgaagttgaatcaacaggtggaaagctagtccat 

ttcgatggaccattcgtgttcacagcggacgacttgttatgtgccactgcagagataatggg 

aaagaacacttatggaacagcatataaggctacattagaggatggtaatcaagttgctgtga 

agaggctgcgcgagaagatcacaaaagggcaaaaagagtttgaagctgaagttgctgaatta 

ggcaagattcgacacccaaatattttggctcttagagcctattacttgggacctaaaggaga 

gaagcttcttgtctatgactacatgcctaatggaagtctctcatccttcctccatgctcgag 

gtcctgagacaacaatagactggcctacaaggatgaggattgctattggcataacaaaaggc 

atctgctttttgcataccaaagaaaacataatacatgggaatcttacatcaagcaacatact 

acttgatgagctaaacaacccaaagattgcagatgtaggcctttctaggcttatgacaagtg 

ctggaaacaccaatgtgattgccactgcaggcacactaggttatcgtgcaccagagctttca 

aagatcaagaatgcaagtactaagaccgatgtctatagtgttggagttatcattttggagct 

cttaactggaaagtcacctagcgaggcaacagatggactcgatttgccacagtgggttgctt 

ccattgtaaaagaggagtggactaatgaagtgtttgatgtggaacttatgagggatgcccct 

aatattggtgatgaattgcttaatactttgaaactagctttgcattgtgttgatccaacacc 

aactgctcggcctgaagctgagcaagtacttcagaaattggaggagattaaaccagagctga 

tgttagcaccccccagttctggaaatgatggcgctgcagttcaagaaaaaaatgaataaact 

cagtaaggtttgattgctaaaagtgtattgaaaaaggtttaggagttccagcttttttactt 

gattgacacccacctatttattctttcatttttttttttgatccagtggagtgagttgttgt 

ctcctattagttctattagtaaactgtatatccgagcttctgattgctgcatagatgcaaaa 

cgcattttgttcaattccctctattctttgcaatgtaatgcaataatagtatctatcttttt 

gatgacatcaacacacgccacgtg 
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ccacgcgtccgccgtgatgtaatcttggtgatgctacttattcccttttcccttcttgagcc 

caaactcaagaaggtcaaaaacaaaaaaattacaaaaagctggaatcttgcagtttttttat 

ttaatttatttatcctatgttgaattaatttttggggtcaatatttcccaatttgtagtctc 

caatggagcctcgtgttggtaataagttccggcttggccggaaaatcggtagcggttctttt 

ggagagatctatctcggcgctaatgttcaaactaacgaagaggttgcaattaagctggaaaa 

tgtgaaaacaaagcatcctcaactattatacgaagcaaagttgtataaaatactacaaggag 

gaactggaatccccaatttaaaatggtttggagttgaaggagattataatgcccttgtgatg 

gatttgctggggcctagtcttgaagatctcttcaacttctgcagtaggaagctgtctttaaa 

gaccgttctcatgctcgcagatcagatgattaatcgggttgaatttgttcatgccaaatctt 

ttcttcatcgagatataaaacctgacaactttcttatgggattaggaagacgtgcaaatcag 

gtctatatgattgattttgggctggccaagaagtatagagactcatcaactcatcagcatat 

tccgtatagagaaaacaaaaatttgacaggaactgctagatacgcaagcatgaatactcatc 

ttggcattgaacaaagtcgaagggatgatttggaatcgctgggttatgttttaatgtacttc 

ttaagaggaagtctcccttggcaggggctgaaagcaggcactaagaaacagaagtatgagaa 

gatcagtgagaagaaagtatcaacatcaatagagaccttgtgtaggggatatcctgcagagt 

ttgcatcatattttcattactgtcgatcactaagatttgatgataaaccagattatgcttat 

ctgaagagaattttccgtgatcttttcattcgtgaagggtttcaatttgattatatatttga 

ctggaccattttgaaatatcagcaatcacagcttgccaatcctccatctcgtgctcttggtg 

gtactgctgggccaagctcagggatgcctcatgctcttgttaatgttgagaggcaatcaggt 

ggagatgaaggtcgaccaactggttggtcttcatcaaatcttacacgtaataagagcacggg 

gctgcatttcaattctggaagcttattgaagcaaaaaggcacagttgctaatgatttatcca 

tgggtaaagagttatccagttctaattttttccggtcaagtggaccattgaggcgtccagtt 

gtctctagcatccgagacccagtgattgcagggggtgaacctgacccctccggcactctgac 

aaaagatgcaagcccgggaccattgcgtaaagtatccagtgctgcacggaggagttcaccag 

ttgtgtcctcagatcacaagcgcagctcctctatcaaaaatgccaacataaagaatttagag 

tccaccgtcaagggaatagagggtttaagttttcgatgatgagggactgcattagtagctgt 

gctttgtctcagttctccgttcactgtaaattttggcacaccaacttggggagtaagagttc 

tgatattagttgctgtcaggaagtaccataaagctgaattatacaattaaaatttgggatcc 

aatcgcaaaagcacattaaggatatgatggggttgcaqatccaaactcacagattccagttt 

atgctcgtccatacagttataggcactttccatattcttttctttaatctctgtctcttgct 

tgttattgttatgtcgtggtattcttgttgaggtcatgtttgtgaattgcgaagatggtcat 

gtataattgccgagaaatcatgtactagtttgttttaaacatgagcaaactgttattttgtt 
caagctactttaatatcaaaaaaaaaaaaaaaag 

SEQIDN037 

Ccacgcgtccgcccacgcgtccggaagaagaagcctgctgccatggcttccgagaaagaagc 




gctccggtgcacattctgcttattcccaaggtcagggatggcttgactggactgtccaaggc 
tgaagaaaagcattgtgaaattcttggtcaacttctttacaccgcaaagcttgttgctaaac 
aagaaggtctgctcgagaatgggttcagacttgtgatcaatgatgggcctagtggatgccaa 
tctgtttatcatcttcaccttcaccttctcgggggacgacagatgaactggccacccggcta 
aaggaagccgagatgaattccagatctcatggagtatccagacttcatccgatcatctatgt 
gtagcacttactgaaaacactatcgtctatgtgtagcgtttgaagaatcaagctctaagctc 
gtcctatgctcctatggagtgacaaataggactcattccgactattatattgatcatcaata 
agagggatttctctgaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 



FIGURE 4 (continued) 
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ccacgcgtccgcaatattttgactaatacactgttctgttcttcacctaattcttttcttct 
tctaataataacgtgctgctaagtcctaaagctcctctttggagctccaattaactccaaac 
taccaaaatccagagcaggtttaacaaagaatgggggttggaggacgtgaagtggcgatttc 
attggatggagtgagggacaagaatatgatgcaattgaagaaaatcaatactgcaattttcc 
cagttcgctacaacgataagtattacactgatgccattgcctctggtgatttcaccaagcta 
gcatattacagtgatatttgtgtaggttcaattgcatgtcgccttgagaagaaggaagctgg 
ggctgttcgtgtttacataatgactctgggtgttttggctccatatcgtgggctaggtattg 
gcaaaatgttgttgaaccatatcctcaatctttctgccaagcagaacgttagcgagatttat 
ttgcatgtgcacacaattaacgaagatgccctcaatttctataagaaatttggatttgaggt 
tactgataaaatccaaaattattatacaaacataaccccaccagactgttttgtcctgacca 
agttcatcactcaaacgaagaaatagatagtctcagctactttgattgagccttggtcaaac 
cttcacattatctttgaggttctgagctttctggttctagtttttgctacttatgagtaatg 
tgccacccattggattgttagtgtaagcccttttctgttctatcttatcctatctgcaacaa 
catcaaagttgaatgatttccctgtaatagaaatagtgcagttcaatgcaaacattcgagtt 
tggttatgttagatcacg 

SEQIDN039 

gatatcgtcgacccacgcgtccgaaagagaagaaatattaaatagcacaagaaaaatggaga 
gtgctaatgcatattctacattgccaatggaaaatgttaacgatgttgggcttattaatttc 
atggacgaggctaactttgaacaatttattgagctcattaagggtgaaactgctgaccctat 
cgtgaagttttgccccaactatgactgtgaacacattacaggttgttttccttcgactga'tg 
tccaatttgagccaacaccaatggatatctttgattggaatgctacaaacatatctaatcct 
atttcacttttttcttccctccccggagaaatgaagctccgggaagaagaagaggaggagga 
agacgacaatgattacgaggaatcttctgggacaacaactaccaccaccatgactatgttgc 
cggcaacgccaacaaagaagagcacgaggactgaccgatcaagaactttaatttccgagcga 
aaaagaagaggaagaatgaaagagaagctttatgccttgcgttccttagttcctaatataac 
aaagatggataaagcctccattataggagatgcaatactatatgtacaaggactgcaaacgc 
aagcaaagttactaaaggcagaaatagcaggtcttgagtcttcctcaaatgaaatgaacaat 
aatccatttcagaataccaagcaaatgaaattgatgactcattatcctgcaatcaagaggat 
atcgaagatggacatttttcaagtaggagaaagaagcttttacgtgagattagtatgcaaca 
aagggcgacaagttgctggttctcttttcaaagctcttgagtctctttctggattcaatgtt 
caaagctccaacttggctacttctgccgatgattatattttgacgttcactcttaatgtgag 
cgaatgtgaggtagacatgaacttggccaatttgaagctatggatagctagtgcttttctta 
atcaagggtttgacttcgagacattaccattggcctaacgtttcattattgtaattgtgcag 
agttttaaccggtcaaagaatgagaaatgtcattatttatcggtcgtcatttgtaacttttg 
attatttagagtcacgtattctaaaagagtaaagtttgtcaaattgcaatggcgcgcatcgc 
actgtgtacatgtgaccgacctaattgtttattacggttgactttgttactactacttttgg 
aatcaaaacagtcatggcgggcgcg 
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ccacgcgtccgctttccacattctctcaactttctctttctaaaactcttcctctttttcta 

gcacacagaccttcaatggcatcgccgcgcgaggagaacgtgtacctggcgaagcttgctga 

gcaagccgaacgctacgaggagatggtagagttcatggagaaagtcgtcggcgccggcgacg 

acgaactcaccgtcgaggaacgcaacctcctctccgtcgcgtacaaaaatgtgatcggagcg 

aggcgagcgtcgtggcgcataatctcatcgatcgagcagaaagaagagagtcgcggtaacga 

agatcacgtggcctccattaaaacctacagatctaagatcgaatctgaattgacttcgatct 

gcaatggtatccttaagttgctcgattcaaaactcattggcaccgctgctaccggtgactct 

aaggttttctatttgaaaatgaagggagattattacaggtacttggctgagttcaaaaccgg 

agctgagagaaaagaagccgccgagaatactctttcggcttacaagtcggctcaggatattg 

ctaatgttgaattagcccctacacatccaatccgattggggctagctctcaatttctcagtg 

ttttactatgagatattgaactctcctgaccgtgcttgtaatctcgccaaacaggcatttga 

tgaggctattgcggagcttgacaccctgggagaggagtcctacaaggatagcactttgatta 

tgcaacttcttcgtgataacctcactttgtggacctcggatatgcaggatgatggaactgat 

gagatcaaagaaccatcaaaagcggaggagcagcagtaatgtgagtgaagcctctttgctta 

ggattgaatcctatggcataactttgctcattgatcgaaatttgctgtttgtgtagttctga 

attccctgaattgtaatacctaaaagcactgtttcttgccatttgttgttttcagcaaagat 

tactttttctctcggtatttcccttgtatttggatgctccagtgaaactctcttatttcgtg 
gaaatgaatgcttg 

SEQIDN041 

ccacgcgtccgcccacgcgtccgctccatgtttcatttactttggagttggttgctaaaaca 
gattaaagctagctgctaagctagtactgttagagttttgttaattagaagaaactaaagag 
tcaaaaacagtggatccaaggcatggaaagaggggacttttcatccaatgaaatggaaatgg 
aagagaaagagaataacgataatattgatgatcctcaacttcaagaggagctctataatata 
tactcagctcgatctcagcatgacatgtctgctatggtttctgtcctttctcaagttattgg 
aaacagtaccacccattcttcttctgctaatgctactccattaaccctacctcaatctgctg 
tagctctccaaaaccaatctcaatctattgaggatcaagggaattcgagaagaaaaaggtat 
agaggagtgaggcaaagaccatggggaaagtgggcagccgaaatccgagacccaaagaaagc 
agctagagtatggcttggcacttttgaaactgctgaggctgcagcaattgcctacgatgaag 
cagctctcagattcaaaggcaacaaagccaaactcaacttccctgaaagagttcaaggcaaa 
ttccaataccttactactactactagtcaaaatcatcacttgcctgataatattgttcaaca 
acaatatattccaactagctccaataataatcatcctctcccttgtcaagaacattatccta 
gtttacatcactatgctcagctacttcagagtgacagcaatattactgatttaaacttcggt 
atctcgccaagttataatcagcagttatctgcttcttttgattttgcgcaatcatcatctaa 
cagtacattatcggaattgccagcttcttatgagcagaggcaattacaatcaagttacaagc 
aagaagaagaagttttaatgagattttcatcgcattttggtactacttcaagctcatctgga 
cctcatgaaagtaactgggaagagtttgaagatagaaagtcataagttcattccctagtatt 
aagagatacgaagactgaaagaagttttatgagatttccgtcgcattttatttcgtagttta 
tggttttactggggttttctgtcctctgatcttgtatttcagttaagtgtaatagtagaact 
atatatattcatgaattaatggaaaaatattggtgtggttttatgtgtttaaaaaaaaaaa 



FIGURE 4 (continued) 
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SEQIDN042 

ccacgcgtccgcccacgcgtccgcccacgcgtccgatttgcttttccctcctcttctcctcc 
attttcctttaatgtcactaaaacagaagaaggggggaacagaaaagagttagaaaatgata 
ggagggaataatagttttgggaagacaatttgctcaatctgttatgaagatctaaatcctat 
tattgaagacctccaatccatttctatttgcggtcacgtttttcacgagatttgtcttcagc 
aatggtttgaatactgtacaaatggaaaaaagaagaattgtccagtttgcaaacaggcttgt 
tcagaacaaaatgcaaataggctttatttccaatcagttggtgatccaaatgatacaagtct 
gaccaagaaaccccgtgatcatgaagaggatccacgtgaactgagaaatgaggtcaaaagat 
tggaggggaaagttttacagttgacttctactttggagaaacagctgaaagatctcaaagaa 
gtcaatgcagagcttttcacatgcaaggaagagttgaaaatagaagcgactctaaagaatga 
agctgtgaaacaggaggcagccattcagcagttgttacatcttaaatccaaggagct agate 
gatcaactttggagtgcataaggctaaaagatggaaatatggctctagatagggagcttgca 
gcactcaagttgagttacaaagaactagtgaccaagtgcgatactcatggaaggcgagaggc 
tcgttctcttaggaaacttgagaagtcaaaagaaaagataaataagttgaagaccagggtcc 
aagaacttgagacggcacttgaaagaaaagaaaaagataatgaaaatttgagaactttgaga 
gctgccaagaaaaactttgagttgtatcaaggaagcaaagaacccaaagttgaccgacgttc 
atafgagaatcagaataaggcacctgctgcgacagaagtagatttatgcatagtcactggct 
catgcaatgatttatctagaccaaggagaaaaagaaagtctaagtctaaggaaaagagtata 
caaaacacggcagaagatattataactggtggaagtcaagtgcagggatcagaaaataagga 
tggaatctcaggttcaaggaattcccctgttattattcttgacgatgatactgatcttccgc 
ttctagatgatgttacacagcatcagccctcgtttcgcatcaggaaagagacttctgcacca 
gttatacttgcccatccaggagatacctgtttttctggtggattattaggtcctgatggtac 
ttactggcacttgggaaaatggtgcaagaaggttaaggacaagggatctggatcactgtctt 
gaggactgcaaggatcaggtgtgactgctgttgatttgattgctgtaggagctgacggtaga 
ggcggttggatcaaagttctgcgatcaatgaatccgggatcattgcaggacaaaaataagag 
tgtcatcagtcaagagatacaagtatgacatgaaatcaagtagttcccagtctcaaggatgc 
ttgcatacagatagcttcttcagaagaaccagtggataaccttattaacagtgctgcttcta 
cattaccaactgtagataatagagattaaattccttatacattgtttaggggtttaattttt 
agcaatctagttatactaccatttgattgaatggtccgaaaagaagaaacttaatgtcttct 
tttgagcatgtaaagtagggattcaaaggggaaaggtagcatacaggggagagaaaagcaag 
aaagcaacttcagcaattgtttcttagcggttttcagttatgttgcttgctcaatccatatt 
gaaagtatactcttggtaaaaaaaaaaaaaaag 



mryt idc a /_ 



a: 
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ccacgcgtccgatcaaatcttggcaactatggcttccttattatcggtgccattgtctctat 

catcctcatcaatgcaattaagactatcatcaaggatagactccattacttcaacaaccaag 

ctagtgaaggaggagagccaagtagagctcctcagaaagttttaccaattgagatgtccttg 

gatgagctaaatatactaactgataactttgatgagaaagctct'gattggaaagggatctta 

tggctgcgtttttggtgctaaattaagcaa'tgaccaacaagtagcaataaagaaattggata 

ctagttcttcaccagaaccagattccgactttgcagatcagttagcaatggtttcaagactt 

aagcatgagcattttgtgactctaatgggttattgcgtggaagcaaacaatcgaatcttggt 

ttatgagtttgcaaccatgggcacgttgcatgatgtattacatggtagaaagggagtacaag 

gtgctaagcctggtctacttcttacctggaatcagagagttaaaattgcttgtggtgtagct 

agtggcctcgaatatctacatgaaaaagttgaacctccaattattcatcacgatgttagatc 

tagcaatgtactactctttgatgatttcacagcaaagattgctgatttcaacttgacaacct 

ctgaatcttcagaggactttggctaccatggtccagagtatgccatggaagaagagataaca 

aagaaaagtgatgtttatagttttggagttattctattggagctgttgacaggaaggaagcc 

aatattagattataaagggcaacagagtcttgttgcatgggcaactcccctattaagtgaag 

ataaagtgaaggagtgtgttgatcccaacctaaataatgactaccctccaaaggcaattgcc 

aaggtggctgctcttgtagaactttgtgttcaatatgaggcagatttccggccaaacatgtc 

aattatggtcagggcactgaagccacttctcaatgcaaattaaactagaacctcaagcatac 

aacattattcttaattgaaacaagctgcagggacagttttaagtaccaacagggctgtgacc 

caagtcatggttccgttgccaatccaaggaaaaaggaaatcttgctaaaggactattggaat 

gttgttttcttaactgatttgtttaagattaaatatattatgttctcattttaaaaaaaaaa 

SEQIDN04 4 

Ccacgcgtccggaatcgacaatgaaattcagccgcgcattcaatgccgcttcagttctctta 
gttcttcttcttattaccatagttacggctaagaagtccggtgatgttacggaattgcagat 
c ggtgtgaagtttaagccaaaatcttgcgaacttaaggctcacaagggtgatagagtctcag 

tacactacagtggaaaacttacagatggaactgtatttgactccagctatgagaggaatgac 

cccattgagtttgagcttggaagtggtcaagtgattaaaggttgggatcaaggacttcttgg 

aatgtgcgtgggagagaagcgaaagttgaagatccctgctaaacttggttatggcgagagtg 

gatctccaccaaagattccaggtggtgctacacttgtcttcgacactgagctggttgctgtg 

aatggaaagagatcagcagctgatagtgaactgtgatttaacgatgtctctacactcttcat 

tagcgacttctaaatctatttttaggttatcttatagttatatttgcttctttttgataatt 

tagatactaaagtattggctgctggcaaaatgacacctcaagtgtgtttccttttgtcacta 

gttttttcctctgctaaagttaagtggatggacgatgaactcccaagatggttttgccatca 
ttacttttaaaaaaaaaaaaaaag 



FIGURE 4 (continued) 
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SEQIDN045 

ccacgcgtccgcgaaccaaaacttccagaacccaatatttacaacctgtcttccgccaccat 
tgacaagaaccttaaaaagcttgcttccaagtaacaataagagaaggttgattggaaaatac 
atttgattatttttattagagaatggcggggactaaggaacagataatggatgttcggtctg 
tggtggaagcagtaacagccgccggagatgatgttgagattgatactcctctttatgttgtt 
gaaagcctctgtatgcgctgtggtgaaaatggtacaacaaggtttctcttaacactgattcc 
acattttagaaagatattattgtcagcatttgactgcccacattgtagtgaaagaaataatg 
aagtggagttcgctggtgaaattcagcctcgaggatgttgctatggcttgcatattccatca 
ggtgatcaaaagatgctcgaccgaacggttgtcaaatctgaaagtgctaccatcaagatccc 
tgaactggattttgagatcccccccgaggctcagcgtggatcattgtcaacggtagaaggca 
tactggttcgagctgctgacggtttgggggcccttcaagatgaacggaagaaagtggatccc 
cagatggctgaagcaatagatcggttcttgataaaactgagagcttgtgcttcaggagattc 
atcctttactttcattcttgatgatcctgctggtaacagctttattgagaacccgttagctc 
catctcctgatccctcattgaaaatcacattctatgagcgaactcctgagcaacaggcagct 
ttagggtatcttgccgacccatcacagcttggaggacaaagtgatgaggtatcaagtgaggg 
tataaataatgttcctcatcacctgctaaaggaaccacatggatcagttggagcaagagcag 
gacgtcaggctattgctcagggtaacagtgcagaaatagctgaagctctatttcgatattca 
gctcctgaagaggtgatgatgtttccatcaacttgtggagcatgcgctgcgaggtgtgactg 
tagaatgtttgttaccaatattccatactttcaagaagtaatagttatggcatcctcttgtg 
atgcttgtggttatcgcaactctgagctgaaacctggtggtcctatatctgataagggaaag 
aaaattacccttcatgtggaaaacattgaagacttaagccgtgatgtgattaagtctgatag 
tgctggagtggaaattcctgagcttgagttagagcttgctagtggcactttgggtggaatgg 
tgacgacagttgagggtttaatcacaaaaattaacgaaagtcttgagagagtacatggattc 
acatttggagacagtcttgatgaagacaggaagagcaagtggttggacttccgagcaagact 
agacaagcttttgagcttgggacaaccgtggacattgatcatcgacgatgcactttcaaatt 
cttttgttgcacctgcaaccgatgatatcaaggatgacaaacagttaacatttgaagattac 
gtaagatcgtgggagcaaaatgaggagctgggtcttaatgacatggacaccacctcagctga 
t get gcttacagttcagca gat get gcacccagt gaga aagctgacgattgatgaatttatg 
cttagtgattttctttcatactgctttggccttaaaatctaaggtaagcgttgattgttctt 
tcatatgactgtagaagagatctagaaccataaaagattgccaacgcctgcagccatgtcta 
catagtggccttgtgactagaactcctttaaatagagagacaaacattttaattagctatac 
gggttcctttaatcaaacacttcagagttattaacaatgcaatttgttttagaagatagttt 
gcaatgcaatgatttttgacttgtaaaaaatatcaatc 
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ccacgcgtccggtggatttgtgtgcagcaccaggcagctggagtcaggttttaagccggaag 
ttatatctcccagcaaagttgtcatctgataccaaggacggcgatctaccacttattgtggc 
tattgacttacagcctatggctcccattgaaggtgttattcaagtacagggtgatataacaa 
atgctaaaacagctgaagtggttattagacattttgacggatgcaaggctgacattgttgtc 
tgtgatggtgcacctgatgtaacgggacttcatgacatggatgaatttgttcagtcccagct 
gatattggcgggcctaaccattgtcactcacatactaaaaggaggcggaaagttcatagcaa 
aaatttttcgaggaaaagacacaagccttctttactgtcagctaaaactatttttcacagaa 
gtgacttttgctaagccaaaaagcagtcggaattctagcatagaggcatttgcagtttgcga 
gaattactctccacctgaaggatttaatgagaaagatcttcatcgccttcttgaaaagattg 
gaagtccatctggcacagaggacctagattgcagtagtgcatggctggaaggtcctaataag 
gtgtatattccatttctggcttgtggagaccttagcgggtacgattcagaccgttcatatcc 
acttcctaaagctgcagatggaacctatcagtgtttagatcctgtacaacctccaattgcac 
cgccatataaacgagctcttgaaatgaagaaagcgtcgaatcaaggaatccaaaacctagac 
aagctttctcttagctcctaatcttaccatccagaattattccattctgtgacattggaaag 
tcgcttatacgtcaccaaatgtaaggactttttattgttaactgcacttgcaattaatgaat 
ttaatgtgttttgttaaaaaaaaaaaaaaagg 

SEQIDN04 7 

ccacgcgtccgtgcagaaatggcgactcgttatctgactcgttccttattcactgctctctc 
tcgctcatacacttctctttctctctctacacctcctccctctctctcttctttttctctcc 
tccgtcttcgaccgcttattgccgtcacagctgtcaacctccgcagcgtctctccggcggta 
gcaacctcctttcgagggtttgcgactcggcaaacgtcgtcgtctttaaatgacccgaaccc 
gaactggtccaaccgtcccccgaaggagacgatcctacttgatggatgtgattttgagcact 
ggcttgttgtgatggagaaacctgagggtgatcctaccagagatgaaatcatcgatagctac 
atcaaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgt 
ctcgacaagacattactatgcatttggagctcttgtatccgaagaacttcattacaagctaa 
aagaactgcccagggttagctgggtgcttcctgattccttcctggatgttaagaataaagat 
tacggaggggaaccttttatcaatgggcaggctgtaccatacgacccaaagtaccacgagga 
gtgggtaagaaacaatgcccgagctaatgagaggaacaggcgcaatgaccgacctcgtaact 
ttgataggtccagaaattttgagagaagaagagagatgcagaacactggatccaacatgggt 
ggtggacctcccaatatgacgaatgcgccaaccccaaacatgggtgggatgcagcagcctaa 
catgggtgggaggcatcagcctagcatgggaggaccacagcagcccaatatgggaggtgcac 
ctcataactacggtggagcgcctcccaataactatggtggagcgccaaacaatccaaataat 
tttcaatacaatagtggacaaagcaacggaggcatgccttaccaaacaggtccaggaccaaa 
ccagaattatgcttcaaatacatctggtggaaacccttatcagaatccaaacatgcctggaa 
gagatatgccccctccaaatcagaactatgctccgaatacggctggtggaaacccttatcag 
aatcaaaacatgcctggaagagatatgccccctcgaaattatcaataggctgatatagataa 
gtatgaactttgtatttccagagttctgtttcacgaaatgagaacatagctatggtgtgctt 
gataggatgttgctgcgtgtaatagttgaatgtgcaaaacttatatgctttgtgagtatgca 
atgtcaaggtgttctcatcctattgcatcctctatgttgacatgctctctgtcaattctcct 
gatgagtttactagcctgaccaagaatatgttatgctttaccatgttgaatgcttgaaattt 
cagggcctcattgcaggtactgttcaaaaaaaaaaaaagg 



WO 03/085115 



22/140 



PCT/EP03/03703 



SEQIDN04 8 

ccacgcgtccggaaatggcgactcggtatctgactcgttccttattcactgctctctctcgc 
ccatacacttgtctttctctctctacacctcctccggtctatcttttctttctctctcctcc 
gtcttcgaccacttatcgccggcgccgctgtcaacctccgcagcgtctctccggcgggagca 
acctcctttcgagggtttgcgactcggcaaacgtcggcgtcgtt'aaatgacccgaacccgaa 
ctggtccaatcggcccccgaaggagacgaccctacttgatggatgtgattttgagcactggc 
ttgttgtgatggagaaacctgagggcgatcctaccagagatgaaatcatcgatagctacatc 
aaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgtctc 
gacaaggcattactatgcatttggagctcttgtatccgaagaactttcttacaagctaaaag 
aagtgccgaaggttagctgggtgcttcctgattcctacctggatgttaagaataaagattat 
ggaggggagccttttatcaatgggcaggctgtaccatacgacccaaagtaccatgaggagtg 
ggtaagaaacaatgcccgagctaatgagaagaacaggcgcaatgaccgacctcgtaactttg 
ataggtccagaaattttgagagaagaagagagatgcagaacactggacccaacatgggtggt 
ggacctcccaacatgacgaatgcgccacccccaaacatgggtgggatgcagcagcctaacat 
gggcgggaggcatcagcctagcatgggaggaccacagcagcccaacatgggaggtgcacctc 
ataactatggtggagcgcctcccaataactatggtgggtcgcctcccaataactacggtgga 
gcacctcccaattactatggtggagcgccaaacaatccaaccaattttcaatacaatggtgg 
accaaccaacggaggcatgccttaccaaacaagtccagggccaaatcagaattatgcttcaa 
atacatctggtggaaacccttatcagaatcaaagcatgcctggaagagatgtgccccctcca 
aatcagaactatgctccgaatacggctgacagaaccccttatcagaaccaaaacatgcctgg 
aagagatatgccccctcaaaattatcaataggccattgtatatgagtatgaactttgtattt 
ccagaattctatttcacgaaatagtaacagttgtagccgtc 

SEQIDN04 9 

ccacgcgtccgcaaaaccctaaactcttcaccttcaaacatcaaaatcctctcgcattctct 
ctagtaatggctaccgctaactcctcttctttctcacctgtatcttccccttcaaaccatgt 
tcccctaaagcgagtaggtactcacaatggtagcttccattgagatgaagctcttggttgct 
teat gattcgtcttacaaacaagttttacaat get cagattgtccgtactcgcgatacccag 
gtgttggaaacgcttgatgcgggtgcttgatgttggtggggtttatgatcctagtcgagacc 
gttatgatcatcaccaaaagggatttcaagaggtttttggacatggtttcactactaagctt 
agcagtgctggtcttgtttacaagcattttggaaaggagataattgcaaaggagctccaagt 
tgatgaagaacatccggatgttcataggttgttccttgccatttacaagagcttcatggagg 
caattgatgcagtcgacaatggaatcaatcagtacgatacagaccagtcacccagatatgta 
aataatactcatttgtcctcacgagttggaagactaaacttggactggattgaacctgatca 
gtcttctgaaaaggagaatgaagctttcgaacgtgcaatggatttagctggcagtgagttct 
tggatcgcgtccgctttcatgtaagatcttggttaccagcacgctcaatcatcatggagtgc 
cttgctgcaagacacaagattgatcctagtggagagattgtagtttttactacattttgccc 
gtggaagcttcatttgtttgagctggaagaggagatgaagattgatcctcccatcaaatatg 
ctttatatcaggatgataggagcaaaagttggcgagtgcaagctgtgggtgtagctcctgac 
agatttgagagcaggaaagcccttccagctcagtggcgaggtttaagagatgatgaactctc 
caaggaaacaggaattcctggctgtgtttttatccacatgagtgggtttattggaggaaatc 
aaagttatgaaggagcactcgcaatggcaaaagctgctttgaagctctaggcacaggaacag 
ttttataaatggatttcagaaactgagtgatctctttatgatttaacattatagctgatcat 
gacatcaggttgccatttaaatagcgcattggagttgaatttattcaaggttattaaggaaa 
ctatacacaaccaggcagacagttttttacatattcagatgctatcttttacttttac 
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agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgtcttcttctt 

cttcttcttcttcttcttcttcttcttcttcttcttcttcttcttcaatttttctctctctc 

tttttctctagggtatacagaaatggggatcgcaacggagaatcaaccacagcaacaacaca 

aggcgtcaccagaggcatcatcagaggcagataaaaagaggtggatgcttaatgattttgac 

attgggaagcctcttggaagaggaaagttfggtcatgtatatctagctagggaaaaaaggag 

caatcacgttgtcgcattaaaagtgctgttcaagagccagctaaagcagtcccaggtcgaac 

atcagcttcgtcgtgaagttgagatacaaagccaccttcgtcatccaaatattttgaggctt 

tatggttacttttatgaccagaaacgtgtgtatttgatcctggaatatgctgccaagggtga 

actctacaaggagctgcagaaatgcaaatattttagtgaacggcgtgctgcaacttatgttg 

catccttagcccgagccctaatatactgtcatgggaagcacgtaatacacagagatatcaag 

ccggagaatcttttggttggtgcacagggtgaactcaaaattgcagactttgggtggtcagt 

gcatacctttaatcggaggcggactatgtgtggcactctagactatttgccaccagagatgg 

tggaaagtgtggagcatgacgcaagtgtggatatttggagcctgggtatcctctgctttgag 

tttctgtatgggatgcctccatttgaagcaaaggaacactcagacacatatcgaaggattgt 

gcaagtggatctcaaatttcctgccaaaccaattgtctcatcagctgccaaggaccttatta 

gtcagatgcttgtaaaggattcttctcagcgtctgcccctaaaaaaggtcctggagcatcct 

tggattgtgcagaatgcagatccttcaggtgtttataagggctgatgaagacatcaccaatg 

actcacaatctttgtggcggactaaattgtttttgtttttcactgaaaaagcctttgctcag 
cgtta 

SEQIDN051 

gtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcg 
tccgcgggaagattctcatgcaattaaccgaatcgtcaaattttcctctaaaatataaagtt 
tctccggaaaatgtcattcatcgatgaatttcaagccaatatagaagctcttccgaaccatt 
tacggaggaaatatgccttattgcgtgatttagataaaagtctgcaaggagtccagaggcaa 
aatgagcaacgttgtgagaaagaaatagaggatatgatacagcgtattaaggctggtaacgt 
gacaccagactcttcactaatcaaattctctgatgatgcattggatgagcaaaagcatgcaa 
tccgaattgctgatgagaaagttgcattagcttctcaggcatatgatctggtagacgctcac 
attcagcagctcgatcagtacttgaaaaaatttgatgaagagctccggagagaaagagatgt 
tgctgttgttactggaactcctgctaccactgttgaaaataatggaaagtccggaaggtctg 
gtgaaggtaagggagggcgcaagaaaacacgtcttgctacagcagcggcagctacagccact 
gcagcagcagcagcaacaccaagtggaatggatttggatctacctgttgatccaaatgaacc 
aacatattgtttctgcaatcaagttagctatggtgaaatggttgcgtgcgacaatcctaatt 
gcaaaatagagtggttccactacggctgtgttggccttaaagaacagccaaagggaaaatgg 
ttttgcgcggattgtgcaggaacacaaaagaagcggaaaggcagatgatagtagtagaagaa 
aataattcagtatactgatttaagacgttttaccaccggaaaaatttatgtagatactgtac 
ttctgtaattttgttatgtgtagccattattaacaagtcactcttgcattctaattgtagga 

gggaagtacaataagtcaacaaaaaatttactcttgtttattatgaactataacgaacaaat 
aaactattgtcttttaccaatcaacatatttgtaatc 
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Ccacgcgtccgcccacgcgtccgtttcatattcttcttcttctgcttcattgttattgttta 
tagaaaaaaatccaagaatggagcagttgcaagagggttttcgttttcgtcctacagattca 
gaacgacttatgtttttgttgagattcattgctaaacaagagatgaatgattctggatttat 
cacaacaaacatcgacgtctatggcagagaagaaccctgggaaatttacaatcacggcgtat 
cctgtggtaatgaagataatgcggactacagcagtaactatcgctatttcattacaaagctg 
aagaagaaaaacaaggcgaggcataatctagaggttggaaataaagggagttggaaacaaca 
agataagggtaaatcagttcactacaaaaatacgggaaattcatcttctgtggttattggat 
gcaaaaagagcttgtgttacgtgaataaacatcagtgctataatcagagcgatggacattgg 
ctaatgaaggagtacgagctttctaatgttattcttcagaaattcgacgaagattgtagaga 
ttatgttctttgtgccatcaaaaggaagtcatgttctactgattatattgagcggccattgg 
caagggtgcagtatcaagtgaatgatttgggggactatatgcagagcaattcagggcattat 
gtggaatctgaaacggacatgacgacacagaacgaggtgcccgaattagaagttcttgatta 
tcaattagaagttcttgggatgaaaaggacttagctgatttaaattggatgttatatgatat 
gcctgtggtcgatcagacggtgaatattgtcgagcagcagaggaaccagaggtcagttatta 
ataagagtgatgaattctatcagatgttggcacaaaacgaagcttttgagttctattgatta 
actgtatagtcatattcttggtagatgatagagatttgattaacaatggcatatgtcccact 
ttgtagaatggaatttaagatagtagtacatctatatatctttgtataacagtatggcgcgc 
gcc 



WO 03/085115 



25/140 



PCT/EP03/03703 



SEQIDN053 

ccacgcgtccgtctatctaagcaatttccgtagttccaaacacggtctaaatcagattcctt 
gcttttttcatctcaaattctctactttttgttgcttcgcaacttggcttagatccttcttc 
atcacgctttgtaactgcttcactcaagctatttcacgatgcgatcaatgcatagatagctg 
aaaattcgaagtgcccggaagaaatggagaaggagtcacatggattaattataggcatctca 
attggagttgtgataggagtgcttttagctatacttgcatttttctgctttaggtaccatag 
gaaacgtcctcagatagggaatagcagttctaggagggctgccactattcctattcgtgcaa 
atggtgctgatacttgtacagtattatcagactcttccattggtacagagtcaccaaaatcg 
actatccagaatggcatgtcagtgtggcttggcggccttaggaaggcaaatgttgtttctgc 
ttccggtatactcgagtactcctacaaggatttgcagagagcaacctacaacttcaccacat 
tgattggtcaaggggcctatggtcctgtttataaggctcagatgtctactggtgagacagtt 
gctgtcaaagtgctcgcaactgattctaaacaaggagagaaagaattccaaacagaggtcat 
gttactgggaaggctacatcatagaaacctggtgaatttggttggatattgtgcagagaagg 
gtcagcatatgcttatctacgtttacatgagcagaggcagtttggcttctcacttgtacgat 
gaaaagcttgaacccttgcactgggatttgagagttcaaattgctcttgatgtggctagggg 
cttagagtatcttcatgacggggcagttcctccagttgtacaccgggatattaaatcatcca 
atattttgttggatcagtcaatgagagctagggttgctgattttgggctttcaagggaagaa 
atgatcagtaaacatgtatccaacatccgtggaacattcggatatcttgatcctgaatatat 
atcaactaggtcattcactaagaaaagcgatgtttacagctttggggtcttactgtttgaac 
ttattgctggtagaaatcctcttcaggggctcatcgagtatgttgaactagcagccatgact 
acagatggaaaaggtggatgggaggaaattgcagattcccgtcttgatgggaagtatgattt 
gcaagagcttaatgatgtagctgcacttgcatacaaatgtgtgaatcgtgcccccaagaaac 
ggccttccatgagggacattgtgcaggttctgtcaaggatacttaaatctagacccgacaga 
aagcgtcccaagcgtttttcatctgcaacagcagaggaggttaccatcaatgctgaacaacc 
agattatcggagtccaaactctggaccccgacgaggggaatctatggacagcccagctgact 
catgtgaagtttaacccagttcttccatttgtttatttttttttttttaatttcttcctctt 
cttttcttcttgtaaaattggtcaggttgttaggttctccattcataacacacttctgtctt 
ggtgcgttcgattggggtacttaggatctgttatagtctgcgtgtaagatagcctttctttc 
tttccaattttgttaaatttttgtaaatttgcgtggaaggtaaccgaatggcagaaggaaag 
ggtgaaaagcccagatcagccttttgtcaattctatgaaagttcatatatctttccacaaaa 
gtgcacgg 
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ccacgcgtccgccagcaagcacagtcgtccacaattagatattgatctgaatgtaccagatg 
aaagaacttttgatgaaataaattctcgagattctgctctagagttgatctctccattggac 
catatgactaatcgtgctgcactgaagaatgaagtaattgattctcctgctgttcgctgttc 
tggaggactcgatcttgatttaaatagagttgatgaacctggtgatgtagggcagtgctctg 
tgagtagcagtagtagattggatggtgtagttttaccttccaaaacatcatcatccattggc 
ttgccaactggggaggtgaggagggactttgatttaaataatgggcctggtgttgatgattc 
cagcgcagaacagtttttattccacgataatcatcagggaagcatgcgttcccaactgcctg 
cttctagcctcagactgaacaatccagaaatggggaatctttcttcttggtttactcctggg 
aatacttattcaactgtgacacttccatcaattttgcctgatcgtgtggagcagccgccatt 
cccaatggtcacacctggtgcacaacgaatattgggtcctcctgctgctggttctcctttca 
ccgcggatgtttaccggagttcagtattgtcatcatcgcctgccgtgccttacccatcctcc 
ccttttcagtatcctatatttcctttcggaacgagcttcccacttccttctgcaacattttc 
agttggatcagcttctttcgtagattcttcctctggtgggcggctttatacgccccctgtaa 
attcacagttgctgggtcccgttggcgctgtgtcatctcaatatccaaggccttatatggtt 
ggacttcctgacagtagcagcaacggtaccatggatcacaatagaaaatggggaaggcaggg 
tctggatcttaatgcaggccctggagtggtggacatggaagggagagaagagtcggtttctt 
tgtcggcaaggcaactctctgttgccggttcacaagcattagcagacgagcatggtagaatg 
tatgctgtacctgggggtgttctgaagaggaaggagcctgagggtgggtgggacagtgagaa 
cttcagattcaagcagtcatggcactaagatctgcaatctggtgattttataagctactgga 
ggatggacttggctaactcctcaaactctcagcttctggcatgctcctgtgggtgggcggta 
agtgagcaaatttgatgtgttcagagtctccgaccaccacctcttcagcttatcagtgtagt 
tgggatttccatggtttgcaagcactgcactttggtcagctatattctctgggtggatgcag 
atgagttttccctctgtagatatttaactgttggaaagcttgaaatctttgatgcccaggga 
ctggggataaatcaatgttatcctgtccaaattattgacaatggaggtccaatttcgagact 
gaatcaaacggaaagcttttctttgtgctttgctgttaatcatctttcaatgcttcccgtgt 
tcttggcttttctctgtcctcctttgcccattacatatgtatacagggttgacaccaaattt 
tggtactaatgctttcatcaggcatgttttagttgttgtggctgccattgtaccataaatta 
aatcgttctaacgttagtttgtagtctcattcacagatgatagaactcttgttaatgatatt 
ttcaatgatggtggggtgatgtgcttgtttttctttcaagctactaatctgaaccaacagtc 
ttgtgagcaacgaaaagacaacttctgttttctgatttggagaaattaaatgggtggagctt 

ttgcatgggttaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcacaattcttctacagtacaagaaaccaaaaaaatggcgagtcttaaagtt 
ccaacatctgttccagaaccttatgaagatgctgagcaactcaaaaaagcttttgctggatg 
gggtacaaatgaggcacttattattcagattctggcacatagaaatgcagcacaacgcaagt 
taatccgagaaacttatgctgcagcttatggagaggatcttctcaaggacttggatgctgaa 
ctgacaagtgattttcagcgtgcagtgcttctgtggactttgagtcctgctgagcgcgacgc 
ctacttggttaatgaagctaccaaacgtctgacttctagcaattgggttatcttggaaattg 
cttgtacaaggtcttctgatgatctctttaaggcgaggcaggcctaccatgctcgatacaag 
aaatcacttgaagaagatgttgcttatcacacaactggggatttccgtaagcttttggttcc 
tcttttaactgcattcagatacgaaggagaagaggcgaacatgacattggcaagaaaggagg 
caaatatactacacgagaagatctctgacaaggcttacaatgatgaggagctcatccgaatt 
atttctactaggagtaaagcacagctgaatgcaacattcaaccactaccttgaccaacatgg 
cagtgaaatcaacaaggatctggaaactgattctgatgatgagtacctgaaattactcagcg 
cagcaatagaatgettgaaaaccccagagaaacactttgagaaagttcttcgattggctatc 
aagggtacaggcacagacgaatgggaccttactagagttgtcactactcgggctgaagttga 
catggaacgtatcaaagaagagtaccataagaggaacagtgttccattggaccgtgcaattg 
ctggagacacttcaggagactatgaaaggatgcttctggctttgattgggcatggagatgct 
tgaatggaatatgtgttctaagattggataagaaactatttcctaatgtctgaagtttgaat 
ttgtttgatgatgtgtggcatgtatgcccagagtttggtttgcattatatgggatttaaata 

atccaggtgttgtgttttggtttttaaaaaaaaaaaaaaaa 



FIGURE 4 (continued) 
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ccacgcgttcgggataacatcattatccttctctcctcttcttccttctttcaaccacaatt 

ctcactcccctctttcgtctctcttctccaacttcaatcccattttcaggcaaaaagctgtc 

atggcttcaatttcagcagcttctgccacagctacagcttctacaaagcttgcataccctta 

ttccccttcttcctcaagcagcagcagcaacactgctgctgtattcccttcaaattcctcaa 

agcttatcctttcctcttcttttacacccaccccttcaacccttttcctccactcaccaaca 

actactccttccaccacccacccccgtcggttcactgtccgcgctgcacgtggcaaattcga 

gcgtaaaaaacctcacgtcaacattggtacaattggccacgttgaccatggaaagaccacac 

tcacagctgctttgaccatggcgcttgcctctatgggcaactccgcccccaagaaatatgac 

gaaattgatgctgcccctgaagaaagggcgcgtggtattactatcaacactgccactgtgga 

atatgaaacggaaaacagacattatgcacacgtggactgcccggggcatgctgattatgtca 

agaacatgattactggtgctgcccaaatggatggggcaattcttgttgtgtcaggtgctgat 

ggcccaatgccacagactaaagagcatattttgttagctaagcaagttggggtccctaatat 

ggttgttttcttgaacaaacaagaccaagttgatgatgaggagttacttgagcttgttgagt 

tggaggtaagagaattattgtcaagttatgagttccctggtgatgaaattcctattatttct 

ggttctgcacttttagctttagaggctttgatggctaatcctagtattaaaaggggtgaaaa 

tcaatgggttgataagatttatcaattgatggataatgttgatgaatatatccctatcccac 

aaagacaaactgaattgcctttcttgatggctattgaggatgttttctcgattaccggtaga 

ggtactgtggcgacggggagagtagagagagggactgttaaggttggggaaattgttgatat 

agttggattgaaggatactaggaatactacagtgacaggggttgagatgtttcagaagattt 

tggatgaagcgatggcgggagataatgtgggattgttgttgagaggtattcagaagattgat 

attcagagagggatggtgttggcgaagcccggaacaattactccgcacacaaagtttgaagc 

tttggtgtatgtgttgaagaaggaagagggaggaaggcattccccgttctttgcgggttata 

ggcctcaattttacatgaggacaactgatgtgactggaaaggttactgtgattatgagtgac 

aaaggagaggaatctaagatggtcatgcctggcgatcgtgtaaacatggtggttgagcttat 

catgccggttgcatgtgagcaagggatgaggtttgctatcagggaaggaggaaagactgttg 

gagctggtgttattcagaaaatcttagaatgatgaacttgcagctgagcatctcttttcaca 

tgatcggcactttccattgaagttacttaatccattgtcatatatgcaacttcttggttact 

tttattatgtcttagaatcttactttagtagaagtatcctgttttaaacaccaaattctact 

gaacttttgggatttttcctcagtctcctctttcatttttcctttgcttgaaaggaatgaga 

acatttgatttcatgcactttatttaatttagaacaaatgtgcgactctgtttaaaattaag 
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c\\\ C ?t C ?ct\ C L\ t ca\ t ^t 3 :tt C a a 9 c a ttt , t9aCaCCtataaaatSC "««^«catt ga a 
9aaaaaaaaaatg?atgci«iaaca™ a ^T^M^ttgttgtgtaaaaga 

atgccatctcgccqcaaaaacir^ a ^H-f. ^ ^ actccaatg atttcttgtgcagaag 
ctcctgagtgaaq???tc?a?a^^ g tttCggatctagaaacttcg tttgagaatgaacaa 

a ccgt?c^?galgtcttggaca?a^ 
-^tgagaa^tt^gttt^a^ 

tccatggattggatacacaaanr?t^ = ^ g 9 agttcag 5g tc tttaacc 
ctttggagctStLt^gcgaag^ 

tctgaaattcgcaaggLa^ctc?c^^ 9gg tCaaccaca 9 a ttgtc g gatgctccact 
cagaaaaatca'agta^gc^^^g^™ 

ggtttgccaagaccgaagaaaqc^acacatra™ 9 If tcaaccaa SS a tccgtggaa 
agttggaagaattaLgLaggtig^cac^at? 339 ^ 

aaa tggaa g ctagctt?a g a g Lggatc a qc?caaSL^? C tgatgatgata t aaa tagc 
acttgctagtttattaaaafr^ 39 ™ ; 9 agttggggatccatccatc caaca 

actagtgt? g ta g ata^ 
tgta g tLc?a a a a c a ctftat g "^ 

gcatcttactgacgcaacaattataactatgcttcagtgtatcaag 



SEQIDN058 



atttcaggcgagcgaggcag^1^ t Caa ^^^^^:"«=9"?g«f-aagggga 

ggt S tg g ct 9 a 9 a 9 cg a c^g 9 t aa £ ^Sffi^t'^StlTr 9 ^ 9,2 " 
aaagcctttattaattactottorttni-ao*. y tgtta aaatactta tgtgtgcaaaagta 



FIGURE 4 (continued) 
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ccacgcgtccgctcgcgttttagcagccactggagaaatcaaataggagagagaaggtagtt 
tctagagagagaaaccaaacaaaacaacaccagtttttagagagagaaaaaaagtaaaccgg 
actattctcgaagaaaattttccggtgactgtataaattattttcaagtgaagtttccatat 
ttgtacactcattgtcaattgattgcgttgccgtctccagattctccattaccgatttggta 
attaggttttcgagatcggttggttatcatccttcgattcgttaattcgggttaacaggaat 
tttttggtttcgattcgataatcgggttcaagtatttcagaaagagaacagaaaaaggaaaa 
aggaggtcttaaatctgtttggaagtgaagggggttttggttgaaagatgttgaccaccatg 
gttggtttgtgatgtaatatggcacgggttgttacagataaagatatgtcgttttacattgg 
tcgcgaggcttcaaagttgtggaagagattttgtgcggagataacaacagagatcaatcttc 
ttgctgagaattggaaatatattcttggcggtttgatttgtcagtacatccatggacttgcg 
gctagaggggtgcattactttcatcggcctggaccaattcttcaggacgtcggcttctatct 
tcttccggagcttggacaagatagagcttacataagtgaaactttatttaccaccatctttc 
tatcttttgtcttgtggaccttccatccttttatttttaagaccaaaaagatctatacagtt 
ctgatatggtgcagggtcctggcattcttagtcggttgtcaattccttcggatcataacatt 
ctattctacgcagcttcctggtccaaattatcactgtcgtgagggttcaaagcttgccacgc 
ttcctcctcctgacaatattttagaagtgctattgattgttcctcggggcgtgctttatggt 
tgtggtgatctcatattttcttctcatatgatattctctcttgtctttgtgcggacatacca 
gaaatatggaacacgaaggttcataaaacagtgtgcttggttagctgttattgcacaaagct 
tattgattcttgcatcgcgcaaacattacactgttgatgttgtggtggcatggtacacagtc 
aaccttgtagtgttcttcattgataaaacgttgccagaacttcctgatcgcactagtgcctt 
gttgcttccagtgaccaaggatagcaagtccaaagaagagaatcacaaactgctgaacggaa 
attctggagatcctgcagaatggaggcctcgaaacgggaagatcgtggaagatgggaaaaca 
gtgcacgttgaagcagtaattaatggtgcatagacgataaacttcatgcaacaccactaact 
gatgcttgcgaccttggtacagagattggtaacaatgccattataagttgtgttaatataaa 
tcgttctgggtgttcttccaagttcaatagttttggttttagcgtaggatacgaaatcaaag 
attgagatgctatcgatgtctccacggtcctctgattttatcaaatgtatcatggaatttat 

tttattttttggttaatgcaatatttcccatccg 
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ccacgcgtccgatagatatagatacagaagagagagagagagaggtggtgg'gtgtgaatatg 

gtatagggcctgagacccctgaaagggcatggaagtgctggccatgctgaggcaactcattg 

gacaagttaaacaactcttacaacaacaaaatacacactctcct'tcttcttcctcctcctct 

tcttcttcttctaacttctcttttcctcttcaatcgccaccgcttttacacctcccaaggtg 

ttatgttctgaatcttgatgacagttctgctgaagacagttgctacaatatcattatgactg 

ctggaaaatctgaaaatctcaagatgttggaacctggcaagcctccaccaaaaaagaaagct 

cggaaggagaggaatcgaggaaaagtgactggaacttcatgctccatagagaatttggatcc 

gcaaatatggaaagaatttcctgaagacttatttgaagctgttattgcaagactaccaattg 

ccacttttttccgcttcagatctgtctgccgcaaatggaactcaatgctgatgtcccaaagt 

ttttctgaacagtgcacccaagttcctcaaccacaaccgtggttctacaccattactcatga 

aaacgtgaatactggagccatgtacgaccctatgttgaagaaatggcaccatcctactatac 

ctgcactgccgaccaagttgatagtcttgccagttgcttctgcaggaggtcttgtctgtttc 

cttgatattggacataggagcttctacgtatgcaaccctcttactaggtcctttaaagagtt 

accagccagatctgttaaggtgtggtctcgtgtggcagtagggatgacattgagtgggaaat 

cagcttacagtatcctttgggttggttgtgatggtgaatttgaagtttacgactccagaaag 

aactcttggactcgtccaggatctatgtcctcaaatgttaagcttcctatggcactcaactt 

caagtcgcagacagtcaccatcggtaataaattttactttatgcgctcagagcctgatggaa 

tcgtgtcctatgacatggttactgggatctggaagcagttcattatccctgcacccctacat 

ctgagtgatcatacactagcagaatgtgggggccgcataatgcttgtcgggctgctgacaaa 

gaatgcagccacttgcgtgtgcatatgggaactgcaaaagatgactcttttgtggaaggagg 

ttgacagaatgccaaatatatggtgcttggagttttatggaaagcacgttcggatgacttgc 

ttgggtaacaaaggtttgctcatgctatctttaagatcaagacaaatgaaccggctagtaac 

gtatgatttctcaaccggagaatggatgaaggtccccggttgcgtgttgccccgtgggagaa 

agaggcaatggatcgcgtgcgggactgcttttcacccccgtcttacagctttggcttaactt 

gggatgcccagtaaatttctagtcacagcagagtgcgatttattatatcatgtggttttagc 

ttttcccatcataatctgcagcctagtgttctctttgctgaatttattaccactctcttgta 

taaacatctagttgttaagcttttcattccagaggactaatctacgactacttattattaca 
ttaaaaaaaaaaaaaaagggcggcc 



• 
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tcttLcatttccacctccaaagccatccttccagaaaaccttgttcct^ 
cccaaaaaagattcccatctcaattccacaaaaaaacacaaggagatctaaggaa^ 
gcctctataLtagagaggtggaattgttcctgaatttggtttgaattgattgattgaca^ 
ttttggtgagagggtgttattgaaaaaatgggtgacatgaaggataaagtcaaagggttcar 

gaaaaaag^L^tcttct^ 

catcttcttcaggaccctcaaatcatgtcaataatttttcatcac^c 
caagatcaacaaccttcatatacaaaaacttcgcctcaaaaaccaagtaattctgatc g 

aattgagaatatatgtgaaattcagttcaacaaaagtgaa J^f^?^^ 



ttggtgaattagtcacttctgggaagas, — - ",,^^-raartcatattqatag 

gaatgccctgtctgtggtagtggttttgtttctgaagaagaggtgtcaactcatattg^ 

ctgt?ta a g?tctgaagtgtcttctaatttgggagttgaaa^ 
aa?tggaaacatgtgttagtgcatatgtttcagggaagccctcagaagg^ 

gtcattaagttgttaaagaatattgtgaaggaaccagag^ 

gatggggaatccaaaaataaaaggtgctataggtgatgttgtagga^ 

aatttgttggatttgagttgaaagaagaaggtggggaaatttgggctgrg ^ * 

tctgaagaacaacttgttatgcttaagaatgtagtttcactcttggaaccgaagaa^ 

agagttggcgtccttatcccaagttaaggcgagtgaaccagttgagccgaagaagattga 

cacaaa^Lgagtgttcttttctgttcccgagagcgtagcagcaaaaattgagctacctgat 

?cc^ttaaLLtcacgtgaggaattgagaagagaag^ 

agaagattccaaattattgattcctaaatcttatcgggaaaa^ 

tccattatccgtgtacagtttccagatggagcattgcttcaaggT-9 



aagcttagagt?cgaattgttacat^ 

cagctgctggggagagggctgtaacagttgaagaggaggatt^ 

aaatttaaacctatcgaaacagattctgttgtttttactggtctrtg y y 
ttagcgagcccctcgagaccggatcagttgcttcctcgtaagctctaaattac 

gaat?c?tctgag?gttggaaaccttataaaactctctgcgccgggaatgct 



aa 
ttt 



ccacfcgfccggactttct^^ 

tatgattatcttatcaagctcctcctcattggcgatagcggtgt^ 
tctgaggttctcagatggttccttcacaacaagtttcatcaccactat^ 

agataagaacaattgaacttgatggcaagcggattaagttacaa^ 

caggagcgtttccgcactatcacgacagcgtattatcgaggagccatggy 

gtacgatgtcacggacgagtcatctttcaataacatcaggaactggatt^ 

agca'tgcttctgacaatgtcaataagattttggttgggaacaaggctga^ 

aaaagggctgtgccaacttccaagggtcaagctcttgctgatgaatatgg * ta 

tgaaacaagtgcaaagacaaacatgaatgtggaagaagttttcttttcaattgc^gy^^^ 



tcaaacaaaggctttcggaatctgattccaagacrgag^uoa^™.- t 
tcggatcaggccggaacttctggtcaagctgcacagaa^ 

ggagacaatcgtgtgggaagaacgttcgttagtugcatttggarg agctt g 

gLLactgattcctgttaacttcattaccaaatatttcttcgccatctga^ 

atgtgtcaaaggcttttctactgtcgttgtgaatctattgtcatgcagttaa 

ttttgataaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgaataaatctgcttttggaaacattgtgttgcatccttctcttcagaggaga 
atagaacacctcgctagggccacagcaaacaccaagtctcaccaggcaccatttcgcaatat 
gctcttttatggtcctcctggcactgggaaaacaatggttgctagggagatcgcaagaaaat 
cgggtttggactatgccatgatgactggaggggatgttgcacccctgggtgcacaggctgtc 
accaaaattcacgagatattcgattgggccaaaaaatcaaataaaggcctactgcttttcat 
tgatgaggctgatgcatttttgtgcgagcggaatagtacatacatgagtgaagctcagcgaa 
gtgctttaaatgctttactctttcgaacaggggaccagtcccgagacgtagttcttgtcctt 
gcgaccaacaggccaggagatctagacagtgctgtcactgaccgtatagacgaagttatcga 
attccctctccctcaagaagaagagcgtttcaaattgctgaagctctatttgaacaagtacc 
ttgctggtgaaggagacagtgacagcaattctaagtgggggcacctcttcaagaagaaccaa 
caaaagaggataaccatacaagatttgtctgatgatgtgattagagaggctgctaagaagat 
agaaggattctctggccgtgagattgcaaaacttatggcaagtgttcaagcaactgtatatg 
ggagcccagattgtgttcttgattctcaactgttcaaggaaatcgtagattacaaggtcgct 
gagcatcaccaacgaataaaactagctgctgaaggtatggagccaacttaccaggggaatta 
actgacaccacaaagatacaagtgtctttcactgatacgaattgttgaaaatttgtttatta 
tctctttggtagtattgcatgcaaaattcattttttccaaacttaggatattgtagtttagg 
tgtactatttctgcttggggaatgagcactggatggtggacgtgtttcagggttcaatggga 
cgttacaatttgatgggtacatagctcacttgggctgtaattgtattgattctgtggatcgc 
aggaaaatacatccattgaatagataaatagtaggcaaaacatgaagtctctttgaaatagg 
tctctgttatcaaatatcaactaacctatcttttgattacc 

SEQIDN064 

ccacgcgtccgtatcttaatccgactccatctcctatctatctctcatacacttaacataaa 
tccacaatcaaattccccactataacacacacccaaattataaagagagaaatttttcgttc 
tgtggtgtttattattgtttgtgggttttgtaaataaatggggtcagaatcagatgagaggg 
aggtgatattgggtgtagatgggggcaccacctccactgtgtgtgtttgtatgccacttctt 
ctcttttccgaattccctgatcctcttccagttctgggccgctccgttgctggttgttccaa 
ttttaatagcgttggagaagatgtagctagagaaacactggaaaaggttatggcagaagcat 
tgcttgatgctggtgtgaaacgatcagctgttaaagcagtgtgtttgggtctatccggtgtg 
aaccatccaacggatcaggagaaaatattaggctggttgaggagtgcattcccaagtcatgt 
taagttgtatattcagaatgatgccgtggctgctctagcaagtggcacgatgggaaaacttc 
atggctgtgttttaatagctggtacaggaagcatttcttatggatt'tactgacgatggaaga 
gaagctcgggccgcgggtgcagggcctgttttgggtgattgggggagtggctatgggattgc 
tgctcaagcattgattgcagtgatgagggctcatgatggtcgaggtccacaaacaatgcttt 
c gagttgtattctacagtcactaggtctttcttctccggacgaactaatagggtggacctat 
gcggatccatcttgggctcgcattgcagcacttgttccagtagttgtatcctgtgcagagga 
tggagatcaacttgcagacgagatcttacataatgcagttcaagaattggctataagtgtca 
aagctgttgtccaaagactacgcttggccggggaagatggaaaaggttccttccctgttgtt 
atggttggaggcgtacttggagccaacaataaatggaatatagggaatgaagtcactaattc 
tattttaaagacttatcctggagcttgtgtaattagaccaaaggtagagcctgcagttggag 
ctgctttattggccttaaatttcttgatgaaagaaacagtagctaatggccatagttgacac 
ctgattgtacatagctaactgtgttaactgtataatcattgaagttctctttaatcggtggt 
tccaattctgggagggcatgtccttggatcatggtactgtacttgccttctctttccattgc 
atatgcagactgctaaaaatgatctgttattcaaatgaacgttgcaccaacttgttgtaaca 
tatctttgtttcctaagttgggcagtcttttggtgctggaggagagggaaggagattgtttg 
gtcatagttgcatttgtattgctgatggttatatagaattcataactgatcagtatgttatg 
taatctcttttatagcattctctgttgggataaaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcaaaatttagaaccccaaaacaatcagtactcctcactcccaatttggccc 
caatttgaattcaaaatctggaagcattcatgtgactttttcattttttcaaaaactctctc 
tcttctctccctccactctctgtgaaaccctagacacacactccatacgctctcgcaacctc 
tacctctctcttaaatcagcaaacgacagcgatctcatgacggcggtgccgcagtctgccgg 
aagagagctcgcaagcccacccaaggacggcatatccaacctccgattctccaatcacagtg 
atcaccttcttgtttcgtcttgggataagacggttcgtttgtacgatgcaagtgcgaatgcg 
ttgagaggagagttcatgcacggaggtccagttctcgattgttgcttccacgatgattcttc 
tggattcagtgct'agtgctgataataccgttagaaggcttgtgttcaactatggaagagagg 
atatcttgggaaggcatgacgcaccagttcgctgcattgaatactcatatgcaaccggacaa 
gtgataactggcagctgggacaaaaccttgaaatgctgggatcccagaggtgcaagtggaca 
ggaacgtactcttgttggaacgtatacacaaccagagcgtgtttactctctttcccttgttg 
ggaaccgtttagtagtagcaactgctggaagacatgtgaatgtctatgacttgcggaacatg 
tctcaacctgaacaacggagggaatcttccttgaaatatcaaactagatgtgtgcgatgtta 
tcccaacggaacaggctatgctctaagttctgttgaaggtcgggttgccatggaattttttg 
atctctctgaggccggtcagtccaagaaatatgcatttaaatgtcaccggaaaactgaagct 
ggaagggacatagtctaccctgtaaatgcaattgcgtttcaccctatctatggtactttcgc 
cactgggggttgtgatggttatgttaatgtctgggatggtaataacaaaaagaggctatatc 
agtaccctaaatatccttcaagcattgcagcattgtcatttagcagagatggtagactcctg 
gctgtagcatcaagttatacatttgaagagggagaaaagccccatgagccagatgccataat 
tgtccgcagcgtaaatgaagttgaagtgaagccaaagccaaaggttttgccgaatcctacct 
catgaaaactatttcagaagctcctcgatcctctcgagtcgactagtttatcttactttgga 
aaacaaaaaaactcttatgtacttaatatttcaatttgacttccaggactcatttctcgtag 
ctggaaattctggagaacagtgataaatttgtaattatccagttagcaattgtacctttttc 

gatgaaaaaaaaaaaaaaag 
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gctggcactgtcaactLtaaSr!; ? 9 tttggCttgtttctcctt gctccgtcg 

^taaatcggV^^^^ 

^ttacgagtgggatrtttLgmtSStSSSa^ J ? CCagcatt S a tggatc 
caa tatggcagatggctgtg g a!ccat?ca^^; 9g Ctgtgctagatt ccattggtgttt 
cagtatgagaatgg^atatS^^r f tC9CagCttcatcaaaatct tccaagg 

aggtgaagatgat'gatgJctcgg^ 
ta ^aaagatca?tgccLgggtcag?aaaca?L 

aaa g^gatattttctggt?g?aq?aa?aaa g ^r tgagcgtca ctt ggag ttctgatgc 

atgaagtctataggataSSt'gac^aaa^ 
tggtcattgcttgcgttgagatg?gaticca?o^r 9 ?^ 

gttctgggacacccatc2tggJactS?tacaat^ 

at g ctttagca g catcacccLcca?aaa^^ g ^r! t ! Ca ! gtCataaa ^ t ^cgtga 



cagttgctgtacccattactcatas^rr^^™ ^ . tgatgt ^ a 9gg<=cttgg 
tccggggagaagcccct?^^ 

tatctcag g t gg t g atgaLctaaa?tt a ??aca a ;L gg rr 
- tccg -tgacatttgtccttcacct^ 

ttcagtcaggcttctttactcttan*-^^ ? Ctatacaactt 9 ca gtaaatacaatt 

tagtg g ctc g t g ttaa g tgcaaLct?ca?ia g r f ggC9gggCtgCgagaacagatc 
ggtgtaat g tt?gcttat g ccaacS^a ga ?J a 5 a ! Cacatgcagt 9caatttctccttca 

tgctagcaagag?ccatggac g g?cajccaaa a ^ 
attcaatggttttcagtgLgaSc??^^ 

tatgtggttgatgctgtLgcttggaicJa^cata?^ 

ttacgaagaattgctaccaaatqaaccJc^?! 9 ttttCacacctc 9 tc gtcaagagca 

agtggttagctgctgtcaactgctttggaaatg^^ 

caacattg g tttatatcaaga?tgaa g gj? a c g ^ a t a ^^ 

aaata g caat g tgcttata g tatccac? 9 rtt^ 9 gCgggtggttttactcctc g 

ctaagcaactaggagaatggJccaaccgg^ 

"tcctggagaagtgattgggctt^???^ 

ctacagttcaagggcgatgtqcttoa^toa^ f tgct aattcatcatgtgtgattgt 

accttaaagcgcaagccgataaaaaataari-t^ff ? ctacata 9 ta ctcctgcgaatggg 
t g aattctgt g cat 9 ca ggg ag cc aa t ga ^ 

tgatcatagaLaaccc g gg attcaaataa g t^ 
cgtattttt ggg acataaa g t g cttta tcac g Jtt aa t aa S^ 

gaggggt g tattcaacccctttcactc a t a ?^;:;? 9 tttactaggaaacgttcc 

c ctatggaaattgcgtagacctqta aag a^ a ^ t agCttcacttcat ^gcaggagttct 
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ccacgcgtccgcctctgagtacccttgagaagtcagagatcgatcttattcgctgccgggag 
atctgattacttcacttgttttgttcttttaggaaagatatcggatctgaccgtcaaacaaa 
gtaaaagatgcaggatcaggaagggcatgtggctgatgcaggaaaagaaacattgacatctg 
ttcaaacatctgaaattgaagattggacaaaatacaaggatgatgatattatgcaacagcaa 
tcttccatccaggctgaacaagctgtaaaaactcaatttgttggcgataaggaacctttgtc 
ttcattagaagctgaataccatctgggaaattcaattttgctggagaaaataaaggtgctga 
gtgaacaatatgctgtccttagaagaacacgtggagatggaaattgctttttccgcagtttc 
atgtttggttaccttgagcacattctggaatcacaagatcaaagcgaagttcatcgcattaa 
agctagtattgaggaatgcaaaaagacacttcaaagtttgggctacgcagaattcacatttg 
aagacttttttgcgttattcctcgagcaactcgataatgttcttcaaggtagcaaagattcc 
ataagtcatgaagaactcctacgcagaagtcgtgatccgtccatttctgactatgttgtgat 
gttcttcagatttgtaacatctggtgaaataaggaagcgctcggagtttttcgaaccattta 
tactaggactaacaaatgcctcagtggagcagttttgcaagtcatcagtggaacccatgggc 
gaagagagtgatcatgtgcagattatagccctatcagatgcgttgggtgtaccaatccgtgt 
cgtatatcttgatagaagctcatgtgagaacaacagcatcaatgtaaatcaccacgactttg 
ttcctacaagcgatggcatggggaatagtggtgtttccaagaccacaaatccatctattacc 
ttgctgtatcgcccaggacattacgacattctctaccccaagtgatgttcttcatttagggg 
tcgtttggtttgaatacagtttatgtcgggataagttatactggtataagttatgctgggat 
tagttatgctaggattgttttttatccattgtttggtatgttgtattaaatatgacaattgc 
ataatctgtaagaagattgtataccggtgctaattaccccaccctcgataaggtataagtta 
tcccggtgttaattttaatcctgggataacttatacgtggtttgctaaccaaacgaagtatt 
aaggtggcat 

SEQIDN068 

ccacgcgtccggaagaaacgaagccggagaagagggctcttcttttcgtggagaagaacaat 
tataggagtattatcttatacttattcttaccaaagatggatcggtaccaaaaagtggagaa 
gccaagggcaggaacacccattgatgagaatgagattcggattactagtcagggtcgcatgc 
gcagctatatcacctatgctatgaccttgcttcaggaaaaaggatcagatgagattgtgttc 
aaggcaatgggcagggcaatcaacaagacagtgaccattgtggaattgattaagaggaggat 
tgttggtcttcaccaaataacgtctattacatccactgatattactgatacatgggaacccc 
ttgaagaaggccttctacctctcgaaaccaccaggcatgtctcaatgatcacaattaccctc 
tcaaaaaaggagctggatttgacttctgtggggtaccaaccaccattgccagcagaccaggt 
gaaagtgttgacagattttgactatgatggaggatcacctagtggtggacgaagaggccgcg 
gtggtagaggaaggggaaggtctagaggtttctcaggaaatggctttatgttggctgaatac 
gatgatggcgggtttgatcgcaatcggagctatggtaggggtaggggtcgaggcagaggtcg 
tagcttccgtggccgtggaaggggagggtacaatggtcctcaggatgcccagcaagacgctg 
acttctacaatcaagaagcacccatgcagggccgaggccgcggacggggaaggggaactcgt 
ggtaggggacgcggtttcagaactaatgggccgatccatggcggtggtgcttaaagatcaaa 
ctttgaagaatacagagattatgtgctatgagtgcctgctccatgttctatgttttttttcc 
cttcagttgttacccgtgttaacagtaggttattgatctgtaatcagagtagactaattata 
gatttcattaccgcccgtatgtggtgagtttttttgttttttttcttgatatcttctagtat 
tttctttctggtagattaggtgcttgatcaagtgtaatttccttagtgagcagcacattctt 
taatttgtctgtgttagacatgttcagtgttgacctcagtgcgtaaatttgcctctgttttt 
agttggcagaatactcaaattacataatttctgctgcgttttatacttctttaactattgaa 
agtctttgcttttacaaaaaaaaaaaaaaag 
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ccacgcgtccgaacagaagctgatcttgtatatatcgtagacgatgacatgattccagggaa 
gaaaatgttgcaaattttagcacatgtagcagggatagacaagtacaagaattctgttttgg 
gaagcattggtaggattttgccatttagacagaaggattttacttttccgagctataggaag 
tttcgatcaaaggaagcagggctttatttgcctgatcctgctta'taatatcactgttgacag 
aattgttcaggtggatttcctctccagttcttggtttctttcttctgaactagtcaagacac 
ttttcatcgaaacgcccttcactttcatgacaggagaagacttgcacttaagctatcagctt 
cagaagtatagaaatgctggatcatttgtgctgccagttgatccaaaggacaaagaaacttg 
gggtgacagtgagcacagacttgcttatgtatccgaaaccactgttatattcaaggacactg 
ttcaagtccgagacaatcaatggtggaaagcactctccactggttatgtaacacaatgggca 
gdaatgaatcctcagaaaattgatgcacttttctatgcccactctgtcgatgaagttaaagc 
tctcgcgcctcttcttgagaaattcaggtcaactgttggaaagaaggcctacattgttgtct 
caggaggcagcttctgcccgtgcgaagatgctgttacagctttgaactggcctaaggttgta 
tgcaaagaaagaagattcaagattatggatttaggagttggtgctctatcaagtatttcaaa 
ttcagaagtgcccgtcgttcaagcagtctatgctagcatgaaaggactaatcaacattcata 
acccgattcttgtgatcacggtagctgatgcagatcctcatgtgaagaaagcactcaagatg 
gctatagaagctaacaccaacagttcatctttagtccttttacctagatcatcggtcactaa 
gcttctttggatggctgatcttcggtccacagcattgccaaattggaatcgtatgaggcttt 
ccataaatatcatcacacagaatagagctaattcactagcaaggcttctcaaggctctcagc 
gacgcatactatataggcgatgaagttcctattactttcaacatggatagcaaagtggatga 
agcaactataaagcttgttaactcattcaattggcctcacggacctaaaagtcttcgaagaa 
gaatcatccaaggaggtctaattcgagctgttagtgagagttggtacccttcatcggatgat 
gattttggcctattactcgaagatgatatcgaagtctccccttactattacctctggatcaa 
atatgctgtcttggcctaccactatgaccctcaaatatcacttcctgaactctcatcgatct 
ctctttacacgccacggttggtggaagtggtaaaagaaaggcctaaatggaatgcaacagat 
ttcttcaagcaaattcatccaaacacaccttatctccaccaattgccttgtagttggggtgc 
agttttctttcccaagcaatggaaggaattctatgtttacatgaacgtgaggttcactgaag 
atccaaagcaaaatcctgttcagataccaaaatcaagaacaaatggttggcaagcttcttgg 
aaaaagtttttgatagatatgatgtacttaagagggtacgttagcctttatccgaactttcc 
aaatcaaacgagcttttcaacaaatcatatggaaccaggtgcacatattgctgctaaagaga 
atgtggttaagcataacaaggctgattttgaagtgccattgttaaaggaagatttcaagaac 
cttttgccaaatggaaaaatgcctccggtaacaaagttgccttcattgaacctcttcaatca 
gcctgtttctctaaagggattaaaagcagcaggagcaaaactagggaaagatgttattcaat 
gcagtccaacggagatagtagccgttcaccacgacacaggtttaccttcacattgtgcaaga 
ttctgaaaactccatactcgtccgatgatcacaaattaattcttttgttttctctcccaaat 
ttgccatgttacattacttggtggaaatgacagttaggaattggtgggagagaaagatgagg 
gtttgattcagctttatttctcatgcaagtaaggggaataaggattctttatgaatgactac 
tgatgagaatgtactcttgtaatattgcagccaaaattggctttctgtatcatcttcttttg 
cctcattttgcaatcaatgaaagtagacacatca 



FIGURE 4 (continued) 
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ccacgcgtccgtggtggcaaatcattctcttcttcgcggggttcttcatcttcgtcgagggg 
gtattctacgaggagagcagatcctagtttttcgtattcagttccctattatgcgccttctc 
cttttgggtttggtgggggtggtggggtttatgttggcccagctgttggttttgggtttggg 
tccagtgcctttcttatcatgatgggttttgctgcttttgttttggtttctgggtttctctc 
tgatcggtctgaagggggcagtgtgcttactgctactgacaaaactagtgttctcaagcttc 
aggttgggttgttgggcttgggtagatcactccaaaaggatctcaaccggattgcagaagta 
gcagatacatccacatcagagggtttaagctatgtgttgacagagacaacattagcattgct 
tcgacaccctgattattgcatctcagcttattcatctgttgatgtcaagaggagcatggaag 
aaggggagaatcgattcaatcaactttccattgaggagcgtggtaaatttgatgaagagaca 
cttgtgaatgtgaacaacattaaaaggaaaagttctacgagccagagggcaaatggatttag 
caatgaatacatagtggttacaatcttggtagctgctgaaggcgtttataaattgcctacta 
ttaatggaagtggagaattgaaagaagctttgcaaaagattgcatctattccttccagtaga 
acactagcagttgagattttatggaccccacagaacgaaaatgacacgttatcagaacgaga 
actccttgaggattaccctctcttgcggcctctgtaagaaaactgggatttcatgcttttct 
tttactttctaaagatcatataggctgctctcaaccactttttgttatcttcatgtatatag 
ctcgtagagcatcgataatacttgtgtaagaatgagaccaaattttcctaattgtactagta 
aaattgttatataaaatgaccagattctccttaaaaaaaaaaaaaaaaaaaaa 

SEQIDN071 

tgtacaaaaagcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgc 
ccacgcgtgcgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagta 
caaagaggacgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaacg 
tgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcga 
gttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaagg 
aatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgagc 
tggacaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttgaa 
aggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactca 
acacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtcc 
ctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagca 
acaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttcc 
aagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatctc 
ggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgga 
ttgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctagg 
tgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacaga 

actataatccatgcctcctttaatgtgtaagggggtggttatcaac 
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SEQIDN07 2 

gtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccg 
cccacgcgtccgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagt 
acaaagaggacgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaac 
gtgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcg 
agttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaag 
gaatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgag 
ctggagaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttga 
aaggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactc 
aacacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtc 
cctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagc 
aacaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttc 
caagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatct 
cggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgg 
attgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctag 
gtgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacag 
aactataatccatgcctcctttaatgtgtaagggggtgttatcaaccttgt 

SEQIDN07 3 

ccacgcgtccgcgaggcaacagatgaagcaggtgtgttgttaactatgagcacgttgactga 
agatggcgtgatttcggtgaagaatgcagcttgtgagaggttactgaatcagagggtggaat 
tgaaaatgaagtcgaaaaagttgaatgactgcttgaaccgcttccatgttgctatgccaaaa 
ccacgtgaccagaaagagaggccagcatgcatacctcaggcagtgttggaagccagagctaa 
ggaggctgaggcagatgctgagaaacagaaaaggaaacttgagagagatctggagaatgaga 
acgggggtgcaggtgtttactctgccagcttgaggaagcactatctattagcaaaagaagag 
tggaaggaagatgtaatgccagaaattttagatgggcacaatgtctacgactttattgaccc 
tgatatcttacaaaggcttgaagaattggagagagaagaaggtcttcgtcaggatgaagaag 
gagatgatgattttgagatggacggcgttgagctgacccctgaagaacaagcagcattagct 
gaaattcggaaacagaagagtttgctcattcaacagcatagaattaagaaaagcaccgcaga 
gagccgacccactgtaccaagaaagtttgacaaagacaaggagttcacttcaaaaagaatgg 
gaaggcagttatctgctttggggctggatccaactctagcaatcaatcgagcccgtagtaga 
tcaaggggtcgtaagcgagagagatcagttgaacgtggagatgacattggtaaggatgcaat 
ggatgtcgacaagattactcccaacaagaagcaaagattgaggtcactttccattacggcaa 
gatcaaggtcaaggtcacgacctccagatgaatttgttccaggggagggcttaaaggacaaa 
gcccaaaagaagatggctataaagatggctaagggttcttctaagaagaggaataaggatgc 
tcggcggggagaggctgatagagttattcctactctgaaaccaaaacatctcttctcaggaa 
agcgatcaactgggaaaactgaccggcgctagtaaaccaagatggcattttatcttggaatt 
tgctgatggtacctgtcaagatgcttgtgttgcaatatcttgggtggcggacagaaaggcta 
aaagaaaactcagcttgtgaggaagatgtcaagaattcaatctattgaaatggcaagaccaa 
gactacagattaagtatttaagtttgtgcttaagatgcagctgaacttgctgcctctattat 
gcatttttggaacttagatacctgttgtaagattgtgtttatcccgatgttaaattttgtct 
cagatttttttgattttctttagtacagcctttcctctcttttttgcatcaactttctgttt 
acacgccctaaaaggcgtattcagaaaatgtattcatctgccaatctccttgggatgttttt 

tttttttgggaa 
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SEQIDN07 4 

ccacgcgtccgattgttaattact get tctgtccccacaccacttaagagcacct cat teat 
ggcttctcccaactcactcaccactttctgcattatccagtgctcattttgctgtaaactac 
agttatttcttagctgaaaatccaagatttgctgttaattcttgacccttttgccccttctt 
ggattttctgttatttttggattcttttttgtgtcttgaagcaaaggaaggcagaaaatgag 
aggaggggtcagtggaagtttcaaacttgagcttctggttgtatttatactgcttctttgta 
tccgagactccaattgcagctcactgaagcatctaaaaggctctctattcaaggacataaag 
gaggacactcttttgccagagatctccccaaatgctgctccacagccccttcttcccctatt 
tgcaccttctccattggcacctttcacaaacagcactttacccaaattatctggactctgta 
cgcttaactttgatgctgtgagaagtatgatgaccgtgacatcaatagattgtgtagcacca 
tttgcacagtatctggctaatgtcatgtgctgccctcaactggaaacaactcttgttattct 
tattgggcggtctagtaaaaaaacaaatatgcttgcattaaatgggaccctcgcaaagcatt 
gcctttcagattttcagcaacttctggtgagccaaggtgccaatgatactttgcagcatata 
tgctctctccatccgtctaatcttactcaaggttcttgcccggtcaaagatgttcatgagtt 
tgagacgactgtggactcgtctagcctacttgctgcctgtggcaagatcgatcttgtgaatg 
aatgctgtgagcaaacctgccaaaatgctatatcagaagctgctaaaaaacttgcacttaaa 
gcatatgatcttttaagcatggatggctctcatgtgctggctgatcacacgaccagagttaa 
cgactgtaaaagtattgtacaccgatggttggcaagtaaacttgaccctgctggagcaaaag 
atgttcttagaggactttctaattgcaaaaacaataaagtgtgccctctggcttttcctggc 
atgaaaaatattacaaaggcttgtggagacgggatgaataaccaatcaatatgctgtaatac 
tgttgagaggtatgtctctcacttacaaaggcagagcttcgtcaccaacttgcaagctttgg 
attgtgctgcttcacttggtcttaagctacagaaagccaatgttagcaaaaatgtctacaat 
ctctgtcacattagcctcaaggatttttccgtacaagttgcaccagaagtttcggggtgtct 
tttgcctagtttaccgtcggatgcaatactggaccaaagtacggggatcagttttgtctgcg 
acttaaatgacaatattccggctccttggccatctatgtctcagttaccagcttcgtcatgc 
aataagtctgtgagaattcccgcacttcctgctgcagcatcgggccaaatcagtaaaggatt 
aaatatatggtcacatatgctactgatggcgtcgatgatattgggaatctgctgtatatcta 
atgctgccaatcttgcttattagctgtattttgtggaagcacattttgaccagaaagaaaat 
tcaaaaattacagttctatgaaggtctctgattgacatcaaaacttaaaatgtacagatgca 
ggaaaatcatgcacctgagtgaaaatccaactcagagatgattccaagatcaaattcgcgac 
gaaatttttattccctttctttgggcaataagaaagttgtgaaaaaaattacacagcaggtt 
tagtttcatgtaattatttccacttgacatactttgcctttatgtatttggaattcctcaga 
aaaaaaaaaaaaaaaaaagggcggccgctctagag 
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acagtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccac 

gcgtccgggatcaagaccctatattgcgttatggtggaatgtatgctttagcattggcttac 

agaggaactgcgaataataaagctatccgtcagttgctgcattttgctgtatcagatgttag 

tgatgatgtccgccggacagcagttttggcacttggatttgttatgtattctgagccagagc 

agatgcctcgtattgtatcgttgttatcagagtcttacaatccacatgttcgatatggtgcg 

gctatggcagtaggcatttcttgtgcaggtactggtctgagtgaggccatctcattgttgga 

gcctttgacatcagatgtggttgattttgtacgtcaaggtgctctcatagcgatggccatgg 

tgatggtccagataagtgaagctagtgattcccgcgttggtgccttcaggcgacaactggag 

aaaattgtcctagataagcatgaagataccatgagtaaaatgggtgcaattttggcctctgg 

tattcttgatgctggtggaagaaacgtgacaatcaagttactttcaaagactaaacatgaca 

aaattacagcagtcgttggactagctgtttttagtcagttttggtattggtatccacttata 

tatttcgttagcttagcattctcaccaacagccttgattggtctcaattatgacctaaaagt 

gccaaagttcgagtttgtatcacacgctaagccctcactatttgagtatcctaagccaacca 

ctgtagccaccacaacttctgctgtgaaacttcccacagctgttttatcaacatcggctagg 

gctaaggcaagggctagcaagaaagaggctgagaaagccattgccgagaaggcagctggaac 

agagtcatcttctggtgcaccaagttctggggagtccatgcaggtggatactccagcggaga 

agaaaaatgaaccagagccatcatttgagatgttgaccaaccctgctagggtggttccagct 

caggagaaatacataaagtttttggaagaaagcagatatgtgccagttaaatcatcaccttc 

tggatttgtgcttctgagagatctacgtcctgatgaacctgaaatattgtccctcactgatg 

caccctcgtGaactgcatccagcactggtggtggatcaactggacaacaggccccggcatca 

gcaatggctgttgatgaggagcctcagccaccaccggcatttgagtacacatcgtgatttat 

ttgtattttaaaagcttcaccaatactttggttttcattccattttggagacgatgttgaat 

ggcagaggtggaaacctatggatcaaatagcacttcctatgatcgagttgaattgtgggata 

cattgaaaagagccccgtggatactgttattctgcctcttgatttccagacttgtgcttgtg 

cttgtcattgtatttcctatgcaagagggactcaaaaactggggactggaaactgccattgc 

gcgttatctttttctgaatctgtcacgtcagctctgtctggactgttagatttttactttat 

gttctaattaagattttatattgttcggatctacaaaaagatttccactgttctccccgagt 
atttatagtcc 



FIGURE 4 (continued) 
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ccacgcgtccgtaaaaccctgcggcctatcggtttatcttctccctccatttccactgtacc 
atacaatacaatggccaccacttcccttagaacgcccaccactaccgtaaggccgccgtcca 
cccccgtcagcgcctccgcggtgaaacccaattgtatcactttcttatcctacttacaccgc 
agacgggggcagactgcactacttccccgacggtgtcgtattca'ccactccgctaacacaat 
tgttcagctgccgcaccggtcggttgagaaatttatagtctttgcttcaaatggggatgctg 
ctgaggccgctcaaaccgagactcaggaacctgagcaggaggtacaagaatccgagcaggag 
gagaatgtagatggtgctgctgctgaagatgcttcggatgagggtgacaatgcagctgcaga 
tgaaactgcatcattcattgcaacttcattgcagttgtacagagatgctttagcaaataatg 
acgattcaaaagttgcagagatagaaatttccctcaagt cca tagaagaagagaaaattgaa 
cttcagagaaaagtagcctcattgaccgaagaactgtcaagtgagagcgaccgggttcttag 
aatcagcgctgacttcgacaatttccgtaagagaacagagagagaaagactttctcttgtga 
agaatgcacaaggggaagttgtcgagaaacttctatctgttctggacaattttgagagggcg 
aaaatgcaaatcaaggtggcaacagagggagaagagaaaattaataatagttatcagagcat 
ttctaaacaatttggggaaatccttggatctcttggtgttgagactgtggagacagttggga 
agccattcgacccattgctgcacgaagctataatgcgtgaggattcagaggaatttgaagaa 
ggtgttgtattagaagaatatcgcaaaggtttcaaacttggagacagactcttacgtccttc 
aatggtgaaggtgtcggctggcccagggccggcaaagccagagacagcggagcctaaagaag 
agcaaaacgaagtcgaggagaagagtgaggaaggtactgctgaaacagcaggtgatgaaggg 
acaggtgaaggaggtaactaactaccagtgatgatgtgacaagtgagggatgtaacctgtga 
tttctcttttgtacaagcaaagaaaaggacatatttcctggtttgattgaggttgagatagg 
tttttgctggtatacctttcaattttcattaactactgtttatctgaaaggacatcatttta 
ggtcagtcggcttatgactgctgtcttaaacactattttttgaggctttggatagttgagga 
ttcatatagtcgatcccaactagcttgggatcgaggcgcaattgttgtaatactccggaaac 
aagagcgtaatgtcatatgccagactgaca 

SEQIDN077 

ccacgcgtccgaaatatgccagggattatttctcgtccgagatgatgttgatgggcaattac 
tacaataaccttggaatgaatttcaatgtaaataataatggcggcggaggaggagggatgtt 
gttttctgggaatccaagtgcgatgacaaacagtggacggagtagcatcaataattcagtaa 
tgagtcagtctggaggttgttcgagttctttttttatcgattcagtgccggggctcaagcat 
gatactgggctggcggtagagtggacccttgaggaacagtacaaactggatgaaggacttat 
caagttcgcgaatgaacccagtataatgaagtatattaagattgcagccgcgctccgtgaca 
aaactgtacgtgatgttgcattaagatgtaggtggatgacgagaaagcgcagaaaacaggag 
gactatagtttggggaagaaagtgaaagacaggaaggataaatcagcagaagcatccatgaa 
aactggtacatcctcagcttcgccattgagcttcattccatattcactctcctcaaatcatc 
gtaaccatggtgaaaatatcccttctgcagcattacttggaacgagacatctactggaagaa 
aacaatcaggctctcaatcagatttcggccaacctttcaacagtcaagttgcaggacaacat 
tgatctcttcatccgaacgagaaataatataacggcagttttaaacgacatgagaaatatgc 
cagggattatgagccaaatgccaccccttccagttttgttgaatgaggaacttgctagtagt 
gttttgcctagtatgactcagccgatgatgtttggctccacaagtggaatccagctgaaaca 
agagccaggctgctgatgcaaaacgcttggtgttaaatttggattactagcttgtgtaagta 
caccaaattttttgctgtaaatgcataaaaagctggcaggtctttgcagcttgggtatacga 
ctgggttccacgggaagaacatttatgagaacctgttttttggaagctgaacatctgaacac 
aagcaccaggaaatagcagcctcgtgttattgcatatcaggggaaaaactgttatcttgata 
ctgcacttacaagcatttttcttcttcttgtttcagccttctgtgtgtaaatttaggggata 
aatcgatctcaaaatcgatt 
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ccacgcgtccgcggacgcgtgggcgcgaactcagattctcaataatggcggcatcgtacgag 

tacgaagacggaggtttccaacagcaaccggattcagccgggtacgacccgaattttgtgcc 

ggattcggtaaaatcgttcgtggttcatctgtacaggcacattagggagaaaaatgtttacg 

agattcaccagatgtacgagtcctcttttcagactttaagcgagcgtatgttcaaggaaact 

ccatggccttcagttgatgccgttgcgccttacgttgataacgaccacgttttctgcttgct 

ataccgtgaaatgtggttccgtcacttgtacgctaggctttctcctactcttaaacagcgga 

ttgattcttgggataattattgcagcctttttcaggttgtgctgcatggtgtggtgaacatg 

caattgccaaatcagtggttgtgggacatggtagatgagtttgtataccaattccaggcatt 

ctgtcaataccgtgcaaagatgaagaacaaaactgcggaggagattgcattgctgaagcaat 

atgaccaggcttggaatgtctacggtgtcctcaacttcttacaagcccttgtagagaaatct 

acgataatccaaatattggagagggagaaggaaggtcttgaagagtttactgctactgatgg 

gtatgattacagtggtggaagtaatgtcttgaaggttttgggttatttcagcatgataggct 

tgctcagagttcattgtctgttgggtgattatcatactggcctgaagtgcttgcgtccaatt 

gacataactcaacaaggtgtttacaccagtgttattgggagccacataaccacaatttatca 

ctacggctttgctaatcttatgttgaggaggtatgtagacgctatccaagaatttaacaaaa 

tccttctatatatttataagacaaagcagtatcaccagaagtcaccccagtacgagcagata 

ctgaagaaaaatgagcagatgtatgctctgttggccatatctttgtcactgtgccctcaagt 

gaaacttgttgaagaaactgtcaattctcaattaagggagaagtatggtgagaagatggcga 

gaatgcaaagatatgatgatgaggcttttgccctctatgatgaactcttctcatatgcatgt 

ccaaagttcattactccctctgctccaagttttgaggagcctcttgtaaattacaaccagga 

tgcgtataggctacagttgaagctcttcctttatgaagtgaagcagcaacaattgttagctg 

gtgttaggacctttttgaaagtctattcaacaatctccctggggaagcttgcaaattacatg 

gaagtggatgaacccactttaaggacaattttgatgacatacaagcacaaaacacatgctgt 

cgattccgatgggaagataacttctaatgctgatgtggacttctacattgatgaagacatga 

tccgcgtagtagaatctaaacccgccaagaagtatggagattacttcttgcgtcagattgtg 

aagcttgaagggatcatgactgatattgacaggataaagctggagtaagctatcttcctatg 

ttctagtattagtgctagcttattttgagctttcatttttgtactcgaaagcaagaaggaaa 

atgcataaagtggaaaaagtatacattttgttgttccccctctgagactgtgttaccggaag 

ttgttgataaatgaccagttaaatccatttttttctaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgcttttccatcagatttcagctcttttactccacagctgc 
agcScLca?tggtggactttctggtaaattcapttggcttctccaacaaagaag^attt 
cLcaagctccaaggtaactcgttcgacactccgaaattatgagccacaattgttacttg^ 

ctctttcacaaagtgggtatgaataaaacccagatcaaaaccctc^ 
attattgttttctcatattgataaaaaccttaaacccaaaattatggttttacaagaaattg 

gS?aStgggtctgacct?gttacatttatcaataaaagcgatttcttgatgaga^ 

?ftacLcfa?taaLcaagtcttgattatcttcgggagtatttgggcagttatg^ 
agctagggttattaagaaagagcctaggctgctttccagtaatctccctaaagtaataccac 

cLata?Ltattgttgcaaaatcttgggttttcgctaggggatattgagacggttttt^ 
caqcqtcctaggtatctgcttaataaccctgagtggcttgagagagtagtaaatcaagcaga 

aaagag?^taLgtac?tcggga g tcacggatgtttcttcatgccattgaag^ 
cgc?tgatgaatcaaaattagaaaggaaattagatattttccggagttttggatg^ctgat 

tctgaLtctgtgcaatggtgcgaaaacttccttactgtttga 

aagaagtacaitgaaatttttcatgaccgaacttgggtatgaacctagttatctggcttctc 
atgcaccactttLaagtacagtatggagaagagggtcaagccaaggaatgaaatcttgaag 
ttLttaaagaaaaccagctgataaaagggaaactaagtctttacactgccgtgtcatctcc 
taaatcacqatttcgtaagaaatatgttcttcctttcaaggagaagatgcctgagttgtatg 

aatqcttqtttaagaggtttattcacttcttgataattttgtactttcattttggtgctctt 

t?caagcatgttgctagtttacctttcattgttgattatacatttatca^ 

ctatgLaactagaaattgaggctagtctcattttcaaatcaactgatgtttcttgttt^ 

gggaaggaaagaagtgtagaaaccagacttgatgtatatgccattgattataaaaaaaaaaa 

aaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgaaaaaaagtaaaccggactattctcgaagaaaattttcc 

ggcgactgtgtaaattattttccagtgaagtttccatatttgtatactcattgtcaattgat 

tgcgttgccgtgttcagattctccattaccgatttggtaattaggtttcgagatcggttgtt 

tatcattcttctattcgttaattcgggttaacaggaattttttgatttcgattcgataatcg 

ggttcaagtatttcagaaagagaaacaaaaaggaaaaaggaggtctaaaatttgtttcaaag 

tgaagggggtttaggttgaaagatgttgaccaccacggttggtttgtgatataatatggcac 

gggttgttacagataaagatatgtcgttttacattggtcgcgaggcttcaaagttgtggaag 

agattttgtgcggagataacaacagaaatcaatcttcttgctgagaattggaagtatattct 

tggcggtttgatttgtcagtacatccatggacttgctgctagaggtgtgcattactttcatc 

ggcctggaccaattcttcaggacgtcggcttctatcttcttccggagcttggacaagataga 

gcttacataagtgaaactgtatttaccaccatctttctaacttttgtcttgtggaccttcca 

cccttttattttcaagaccaaaaagatctatacagttctgatatggtgcagggtcctggcat 

tcttagtcggttgtcaattccttcggatcataacattctattctacacagcttcctggtcca 

aattatcactgtcgtgagggttcaaagcttgccacgcttcctcctcctgacaatattttaga 

agtgctattgattgttcctcggggcgtgctttatggttgtggtgatctgatattttcatctc 

atatgatattctctctagtctttgtgcggacataccagaaatatggaacacgaaggtttata 

aaacagtgtgcttggttagctgttattgcacaaagcttattgattcttgcatcgcgcaagca 

ttacactgttgatgtagttgtggcatggtacacagtcaaccttgtagtgttcttcattgata 

aaacgttaccagaactgcctgatcgcactagtgccttgttgcttccagtgaccaaggatagc 

aagtctaaagaagagaatcacaaactgctgaatgggaattctggagatcctgcagaatggag 

gcctcgaaacgggaagatcgtggaagatgggaaagcagtgcacgttgaagcagtaattaatg 

gtgcatagacgacccactaactgatgcttgcaaccttggtacagagattggtaacaatgcca 

ttacaagttgtgttaatataaatcattcctggtgctcttccaagttcaatagttttggtttt 

agcgtaggatacgaaatcaagtcaaggattgaaatgctatggatgtctccacggtcccctgt 

ggttaaatttaatgttatcaaatgtatcatggaattcattttattttttggttaaaagcaat 

tattttcttatttccaaaaaaaaaaaaaaaggg 



FIGURE 4 (continued) 
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ccacgcgtccgcccacgcgtccggcaacaacagcaacagcaactcttgaaggctattcctca 

gcagagaaacccacttcaaccgcaatttcaaccacagaatcatgctataaggtctcctgtaa 

aaccagcttatgagcctgggatgtgtgcccgtcggctgactcattatttgtatcagcagcaa 

cacagacctgaagacaataacatagagttctggagaaaatttgtcgccgagtattttgctcc 

aaatgccaagaaaaagtggtgcgtctctatgtatggaagtggccggcagaccactggagttt 

ttcctcaggatgtatggcactgtgaaatatgcaaccgcaagccaggccgtggttttgaagcg 

accgctgaagtcttgcccaggcttttcaagataaaatatgaaagtgggaccttggaagagct 

actctatattgatatgcctcgtgaatatcagaattcatctggacaaattgtcctagactatg 

caaaagcaattcaggagagtgtttttgagcaacttcgcgttgtacgtgatggtcagcttcga 

atagtgttttcacagcctgatctaaagatcatctcttgggaattttgtgctcgacgtcatga 

ggagctaatccctagaagattgttgatacctcaggtgagtcaactcggcgctgcagctcaaa 

agtaccaggcagcaacccaaaatggatcatctactgcatctgtttctgagttgcagaataac 

tgcaatatgtttgttgcctcagctcgtcagttggcaaaagctttggaagttccattggtaaa 

tgatctaggttatacaaagagatatgtgcgatgccttcaggtatgcaccttgttctgatgct 

ggaaggtttttattttggcccttttctggaatttggagacattccgcttgtatcaatgtgga 

tatcactacaaattcttgaaatatttgcttctgttagtgcttttaacttcccgaccaggtca 

tggtttgcctgtt.ctgtgcgtgatatctgactcaaccagttctatccaactttcttatcctc 

tccggcccccctctccttttaatctgtcatcttcccgtggaattcaaagcaaagctgaaaat 

gaaggtcgaaattcagatttctagcatgtagcagctcagacaaccaagaggttgtgagttcg 

agtcacccaagagcagggtagggagttattggagggagggagccgagggtctatcggaaaca 

gcctctctacccaccccagggtagggctaaggtctgcacaaactaccctccccagaccccac 

tagtgggattatactaggttgttgttgtaatattggaaattcctactggtaaatctgactca 

tgagttaatgtgtgaagtagacggaatgatgtggcca 
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ccacgcgtccgctgacgcgtgggttccactacatcaagacatctactacactcatctttttt 

gcacttattgggtgtaaatttttgaaacccagttgagaaaaatgagtgtgttacaataccca 

gaagggattgacccagcagatgttcagatatggaacaatgcagcatttgataatggagattc 

tgaagatttgtcttcgctgaaacgttcttggtctcctctgaaacccctttcggttaggccat 

cagattcctttgaatctgatttgtcaagtaaggaaaatcaaactcctttatttgagaattca 

tctgttaatctctcatctccgttacccataaagccacttaaccctaatggggctctggaaaa 

ttcaagactcaagccgaacaagcccaattccaaacagagtcttgatgagatggcggctagaa 

agagcggaaagggaaatgatttccgtgatgagaagaaaatagacgaggaaattgaagaaatt 

cagatggagattagtaggttgagttcaagattagaggctttgagaattgaaaaggctgagaa 

aactgttgctaagactgttgaaaagcgaggaagggttgtggcagcaaagtttatggagccaa 

aacaaagtgttattaagattgaagagcgtatatcaatgagtgcaagaacaaaggtggagcag 

agaaggggtcttagtttaggaccatctgagatttttactggaacgcggcggcgagggttgag 

tatggggccatcagatattctagcagggacaacaaaggcacggcaattgggaaagcaagaga 

tgattattactcctattcagccaatacaaaacaggcgaaagtcgtgtttttggaagcttcaa 

gagattgaagaagagggaaaaagttcaagccttagtcctaaatcaaqaaaaactgctgcaag 

aacaatggttacaacaaggcaggcagttactacaattgcatcaaagaagaatttgaaaaaag 

atgatggacttttgagttcagttcagccaaagaagttgtttaaagatctcgaaaagtctgct 

gctgctaataagaagccccagaggccggggagggttgtggctagtaggtataatcagagtac 

aattcagtcatcagtagtgagaaagaggtctttacctgaaaatgataaggatgagagtaaga 

gaaatgataagaaacggtcgttatctgtagggaaaacgcgtgtgtctcaaactgagagcaag 

aatttgggtactgaaagtagggtgaaaaagagatgggaaattcctagtgagattgtagttca 

tggaaacacagagagtgagaaatctccactaagcattattgtgaagcctgatttgcttccgc 

gaattaggattgctcggtgtgtgaatgagactcttagggattctggacctgctaaaagaatg 

atagagttgataggcaagaaatcgtttttcagtagtgatgaagataaggagccacctgtctg 

tcaagttttaagttttgcagaggaagatgctgaagaggaataatgtgtaataaagggagctg 

ctaactcttttcatgctctttcaattttcaatcctgccttttaatttttgttcattcgtgcc 

ttttaattgaatggggaagcattcttttgcttcctcaaactggtattctagcttctgaatta 

cattgtatggtacaatatgaataaggttttgtcttccggcaggttgtccaagttagttttta 
gcttaaaatagatgcggca 

SEQIDN083 

ccacgcgtccgctttcacaaagcattgtgtgttctgatgggatggaactatctggcacctcc 

tgataactcgaagtcatatcagatgttatctgctgatgaagcaactgcaaaccgagatgatc 

tagttttgtggccccctttggtaataatccacaacactatcacaggaaaacgtgatgatggc 

cgcatggagggtttgggaaacaaggcaatggatagttaccttagaggtattggatttcacaa 

tggaaaggtgaaggccttgtataacagagaaggtcatctaggtgttactctggttaaatttc 

caagtttaatggatgccatgcggttagcggaatattttgagaaagataaccgcgggagaaaa 

ggttgggctcgactgcagcccgtgactctaggcaaggatgacgagaacaaccctgaccttgt 

caaggttgatcataggactggagagaagaagagagtcttctatggttatctgggaactgtta 

gtgatttagagaaggttgatttcgactctcgaaagaagattaccattgctagccgatcagat 

tatgtgacatctggttagaaccacttgaatagctttacattaagatgtgcttcagttgagaa 

ttttagtcaattccctgctctagatattctggctttgtgttacttttattgcccttaggaat 

tggggcagctttctctgggataactgtggagctaagttataagtgccatgcatgcgtcttcc 

cctcttctagtgaatctttctgcctagattagcagttttaaagtccaatggactcgctgatt 

gttcttgtccttgtcccgccttcctcggtttgaggctgggtgtaccatttgggtttcgaaaa 
gttcaggca 



FIGURE 4 (continued) 
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SEQIDN084 

ccacgcgtccggccgaaaacaatgggaagaggaaagttcaaaggaaagcctactggtcgtcg 
ccagttctccactcccgaggagatgattgctggtacttccgctcgacgtcctcgcacgttta 
ggcaggaagaggctgaacttgaggaagaagagagatctgaggagtctgaagaggaatctgaa 
gaagattctgatggagagaagaagaaaggtactcagggtattattgagattgagaaccccaa 
tttggtcaagccaaagaacgtgaaagctaaaaatgttgatattgagaaaacaactgagcttt 
cccgacgtgaaagggaagagatagagaaacagcaagctcatgaaaggtacatgaggctgcaa 
gaacaaggaaaaacagagcaagctaggaaagatttagaccgcttggctctcatacgacaaca 
aagagcagaagctgcaaaaaagcgagaggaggagaaagctgccaaagaacagaagaaggtgg 
aagctcgcaagtaacgaatagtaccatgaaatgttgttttcaattctcctagtacaagatat 
atccctaccattattggctaatgatggagtttacacttccacctttcgttcatgtcctgtct 
agtttaaatggagaagagttctctatagaggaaatcatgaaattatactttaagctctgatt 
ctgtacacaaaatagatttgtttggccaatatgatgggaggatttaccagtccttttgttgg 
gttgaaataaggttattgcgactaattaaactatcttgcagtgtgtgtgctatgaggagaaa 
tactttccatggaaaatgtttctaaggaaagtggttttttaaaaacttattttcctgtgttt 
gcttggtgtgtggacgcacgatctttgtatccctgaggtgctttttcaaaagattggaatat 

ataatggtttgagcaggc 
SEQIDN08 5 

ccacgcgtccgctgctatctgatattagaagtatattggcaacgacacagatgtggccttta 
tatgttctcgttgagtaagttttcaaggacagaagaagaatactctcaaggatctggaaggg 
atcgtttaagcttggtagagtccttcctttgtaaacgatcagttggcacaaacaaatgccac 
aatacagtcctctcatgttcgtacttttgtgaagttccacatattaaccagttacattcatg 
ggactgtggtcttgcttgtgttttaatggttttgaggactctcggtaaggattatgatatgc 
aagaacttgaagagctttgctgcactacaagtatttggactgttgatctggcatatttgagg 
cagaaattttctgtcaacttttcctactttacagtcacattaggagcaaatccaagtttctg 
cgtggagacattttacaaggagcaattgtctaatgatctggtccgagttgatatgctattcc 
aaaaggcacgtgatactggtattaatatagagtgcagatcgattagcagtgaagagatttct 
tcattgatcttatctgggaaattcattgcgattgctttagttgaccagtacaagttaagtca 
ctcttggctggaagatattggtatatcagacttctgcaatgacaacccaggctatactggtc 
actatgttgtcatctgtggatatgatgctgatacagatgagtttgagattcgtgatcctgcc 
agttcaaggaagcatgtaaaggtctcctcaaggtgtttagaaggggcccgcaaatcatttgg 
aaccgatgaggatcttttactgatccgtttacagaaggaagagactgaaagcagccctttgt 
gatcgtttatttatttgtgtatgaatgattgtttctctctgactttgtccccgctgcgtatt 
gcccatatcgggtattctttagctgtatgtatattatgtacatcaagggctgtagtatcatg 
aatttcgcttccctgtatcatgaattttgtatatgatgcttggagcacc 
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SEQIDN08 6 

ccacgcgtccgacttgattctctgctctccccttgactttcacacactcaaattcattttca 
tatccactctcaaatccagaaatgcaaatcccccatccccacccctccacccccatctccga 
cgccttcgtccacaagcgcggtaaaaaacgcggtagctacaactgcggccgatgtgggcaac 
ccaaaaaaggccacgtttgccatatctctaacgaccttaataatcatacagatgttcctact 
cccacaccgaccgatgccaaatcatttgttctcccttctccgctatccgttattcgtcctca 
gcaacttccacctccgccacgtcagccacttccccagctccggcgagcgctttctttcgatg 
acgtggatgtcagcgatgatgagtcgcctgtatctgatgatgatgacgtggattgtttggat 
ctggagagtgagttggatttaggtgggtccgggaagttacccgcgagtgctttgtgggaagt 
gcttaagagattgcctccatcagcgctgctttctgcggcgaaggtgtgtaagggttggaggg 
atgtttctagaaggatctggaagtcggctgaggagctaaggcttggagttcctgtgaaagct 
cagattgggcttgttggatcagtgttgcagaaatgccctggacttgttaagctttcacttag 
aatggaaagtgatgtggacgcaacgatgctggcttgcattgcattttcctgccctaatctgg 
attcaatggagatccttacttcagatacctcagttaatcggatcacaggggatgaattaggc 
cgttttgttgctgacagaaggtgccttaccaatctcaagatggaaggctgctcaaatcttgg 
ggcctttactctttcttcaaccagcctttccactctttgcctttcggatctcttttgtcact 
ctaagatggtcttcaactgccccaatttaaaggagatttccctggatttttctcgccaagag 
aaggatagcactgatcttactgctatggtagatggtcttggaaggagctgcccaagactaca 
gaacattcatgttgcatctgttcggcttacacatgctgttgtgcttgctctaacagcagcaa 
atttaaggggattacgaatgctttccctagtactagggtcagaaataactgatgcatctgtc 
gctgctattgcatcgagctactcgaggcttgagttacttgatttgagtgggtcaagtattag 
tgacagtggcattgggatgatatgcaatatatttccagagacattgtctaaacttctccttg 
ctctttgtccaaatatcacttcaagtggcattcaatttgctgcagctcagttgcctaatcta 
gagataatggactgtggaatgaccatatgtgatccagatttagacagtccaacaactcagga 
aaatgataacggcgaattacaaagaacaccgattagtaaattacaccttatatatcagaaac 
tgattatcaaacacaaccgcttaaagaaactcagcttgtggggttgctctggcttagatgca 
ttatatctaaattgcccagagcttaatgatttgaacctgaactcctgtacaaacttgaatcc 
agaaagattgctacttcaatgccccaatctggaaagtgtgcatgcatcatgctgtcaagaca 
cattggttgaaactcttcagaatcaggtttgtggtgattttatggctggagacaatcatttt 
ccatccaaacgtcttcctgatggctcaaaggatcagagttcctcatttattcagcccccagc 
catttgatgatgagaagagaaagagaaggatttcaaagcgacggtgcgcggtgcttgtttat 
tagtcaaatacttgtcttgtattggctttgttgtactctagaccaattgtccattatttgtt 
atatagtgatctgaggctaaggcctgatcatgtaattttcattgattaaactatactcaacg 
tcaatacagggattgtatttcctctatcaataaaaagtacagcagcc 



FlfillRF d /continued^ 
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SEQIDN087 

ccacgcgtccgtctcaatccaaacttcgagttcacatttgccctagctttgagaaatgatca 
tttgcaaactcaaattattgaattagatcataatagaatccgtaatatacccaaattgtatt 
gttttgttttgatgtgtgttgtgaaaatactgtaaattttgaacaattcgattatggccgat 
ggtaaggtggttaagcgtgtcaagtacaaatcttcagtcaaggaccctggcgtttctggcgt 
tttgaagttgaccaaggaacggtttttctttatgccgaatgacccaacatcaacgacaaagc 
ttaatgtggagttcaagttgattaaaggccacaggtcttctaaagagggttcaagtaagcag 
gctcttcttaatctcatgcacgatcagggcaggaattata'tttttgagtttgatagcttccc 
ggaccgcgacaagtgtcgagaatttgttgcctctgcaattgcggtttgtggagaagttgtga 
aagctgcttctgaaaaacctgctgttccacatgatgaacaactcagtgcagcagaaatggga 
cgtcggattaagttactgcaggagaatagtgaattgcagaaactccacaggcaattggtcat 
tggaggtattctatcagaggctgaattttgggccgctaggaagaagctactggaacagggcg 
atatcaagaagccaaaacaacgggtggctttaaaaaacgacatgtggagtgtaaaaccttta 
tccgatggccagacgaacagagttacatttaacttgacaccggaggttattcatcagatttt 
tgctgagaaaccagctgtccgccaagcatatttgaaatttgttccgggcaagatgtcagaaa 
aagaattctggactaaatattcaagagctgaatacctccacagcacaaaaaatattgttgca 
gcagctgctgaggctgctgaagatgaggagcttgcggttttcttgaagcaagatgacatgtt 
agcatttgaagctcgtaagaagatcagaagggtggatccaactctggacatggaagcagatg 
aaggtgatgattacatgcatctcccggatcatgggctacctcttgatgaaactaaggagatt 
ctggaaccacagtatgaaccattcaagaggtcgttctcgcagtacctcaaccagcatgcagc 
agtagttcttcgaggaagagttatagatgttgagctgggtgacacaagatctgttgctgaag 
cattcatcaggacaaatcaggctgaactagctgccgaagtgtctgatgagagtgcatataga 
gaacgcatagctaaagtttctcgagttgctgaaattgaggatcttcagggacctcatgagcc 
accagttgcattgctaagtatcaaggatcctcgggattactttgattctcagcaagcaaatg 
caataaaggctttgggggatgctggtacagggacaagacagctgaaatttagtgtgagcaaa 
gaagaagccttttgctccttgaagaactccatcttcgagataaattcacaaggattgatcga 
accaataattagtccagaagtagctctcaaggttctcaacgggcttagtcagaatatctcga 
gtacaaagtatcatctgggaaagaacccccatgagagtgttttagataggctgcctagtgca 
acgaaagatgaactattactccattggacatcaattcaggaattattgaagcacttctggtc 
atcttatccaataacggcaaaatatttctacaccaaggtgactagattaaaggatgcaatgt 
ctcagatataccccaagttgcaggagatcaaggaatctgtgcaatcggatttcagacatcaa 
gtttcccttcttgtacagccaatgcttcaggctttagatgctgcctttgcccattatgatgc 
agatatacagaagagatctgccaaaagtggggagagaccaaatggatttgcttaggcaaaat 
ttttctccattttcatccgatatttaagctctttgttttctgggggttatatacacgaatgt 
acatttaacaaaattttgttcgagtgtgttatagcatattctatatcttgacagttctaatt 

gactgcctgcggtaattgtacatctagtggaataatggttg 
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ccggtccggaattcccgggatatggtggacccacgcgtccgctctttggatgttatggaagg 

atcaaagtggaatatgactcgaagtggcttttgtggaatgcggtctaaaaagtttgatggct 

tcatcgatttggatggatatgacacgatagcgcttaaacttaaaggagatggaagatgttat 

atttctactatatacacagagaattgggtcaatactcctggaca'agatgaagataattcatg 

gcaagcattcgtttttgtgccaaaagaaaactggtatattgcaaaaatcccgcttactcgtt 

atgtacctacttggagagggaacatgataaatgcaaagttggagatgaatccagctcgaatt 

cttggcatgtctctatctgtcaacgcagaaggtggagttccaggtgccaagtctgggcctgg 

tgatttccaagtggaagttgattggattaaagccttgcggatgcagtaagcaaaaggggaac 

atcttaaagaattattagaataggctgggacatttggggcatccacgctcaccagttgagca 

agattgtggaaatgccattcagagatggagaagatacaggttctttttctatgtaccttgga 

ggaaaagagagaattgagctgaggaaaggagtgaaaccttaaaatgcagtgactacaggcca 

caccaccaagtcaaattatcagatttttttcttgtaataaatggggctcttcaatttttctt 

taggctatcaactagtatggtaaactaagcagtatgtttaataattatatcctcgtctgtta 

caaggtttggcaatcaaataatacaacaatgtgcttggaatcggtagtactgttaaaagatt 
taatgtcaatgtgcaatgcgc 

SEQIDN08 9 

ccacgcgtccgacctcttcacctttacaaacttctacaaatatatttacttcaaacaactga 
gtagtcctattgtttctgattcgatgtcggttagaacagtgaaagtgagcaatgtctctctt 
ggtgcgtcggagcaagatatcaaggagttcttctcattctctggggatattgagtatgttga 
gatgataagtgagaatgagcgatctcaaattgcatatgtcacattcaaggatccccagggtg 
cagaaactgcagttcttctttctggagccacaattgttgatcagtctgtcatagtagccctg 
gaacctgactacgagctgcctcctacagctccagtgccaatcaaggcaactgagagggctaa 
tgcagctggtggtggatctgctattcaaaaggcagaagatgttgtgagcagcatgttggcaa 
agggcttcatcttgggcaaggatgcagttaacaaagcaaagacatttgatgagaaacaccag 
ttcatatccactgcatcagccaaagttgcttcactagatcaaaaaattggacttagtgagaa 
aatcaatatgggaacaactattgtgaatgacaaagtgaaagaaatggaccagaagttccaag 
ttactgaaaagacaaaatcagcttttgcagctgctgagcagacagttagcactgctggatca 
gccatcatgaagaacagatatgttttgacaggggcatcttgggttactggtgctttcaataa 
ggtcaccaaggctgcaggggaagtgggccagaagacgaaggaaaagatggcagaagaagaac 
agggaagaagttcagctgcaggttacgtgcctatacatgctttctcggagtccccaaaagct 
tccaaaaccgaggaacctgccaagccctcttcacctaagggcctaattctctagcttgtgca 
aaaatatttcaaaactattgttcaattccgcttgtctgatcttttagctgtcattgtgttgt 
ggttagacttagatatgctagttatacataaaatgtcctgtacgattgttgatacatggaac 
gatagttgctggactattaaattccctgtcggagtgctgtgcgg 



FIGURE 4 (continued) 
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ccacqcqtccggagagtaaaagtggatcctatgaggaagagcgtcagtttgaacaatctctc 
acaSaSaacagLLatgcLacaacagcgctgatacatctaaagtggctgaggaaggat 
ataSc^gcagatgacgctgttcaacaccactccaacagcggtcgcgagcgtaagcgagga 
atacca?ogJcggaggaagagca.caagttattcctattaggattgcagaaa g tg g ggaaagg 

atactcagaagtacttcctccgacgaagcaacctcaaccgtcgtcgccgccgatctagcctc 

aatLcagttccaccagttgtagcatcatcaccaacattgcctactactatagaggctacca 

aaaccaa?gcatttccagtggcacctatcatgttaccagcacagattgatcagtcaagag^ 

agtccaacLtgttgcaacgaaatcaagtgaattcgtatacgccagttcgccctcttcctat 

grtttcaatgcccaatccatcaacagtatttgaccttaacgtgaaccagatctcagaa^cg 
LccattgtLctgagattatccttgtcacttgatcagggacaagcatcatctactagacac 
cactcggLtttaaagtaatgccaagcttcagtaatggagagagcatcattagtgtggcatg 

agatcgLggatctgtgagaaaaaaatgaaagcaatatggaaagtaaaaatagga^^ 

qqqtacgctqcactcataattatattaagggaatgtttatttaaggagagattaattgacta 

gacatt^ggLctgatttgtacagaccagaaatatgtcatgccttgtggttacctgtttaat 

gcaacgagtatactgac 
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ccacgcgtccgactttttccactgagctccactccaatgtgtaaaaccctagctaaaaatct 

ttaaagttagggtttcaaatttgcaatggggaactgctgcagatctccggcagctgtcgcaa 

gagaagacgtgaagtcttcaaacttctccggcaacgatcacggccggaaagacaagtccagc 

gccggaaaatcgcaaaaaccggtaaccgtgttaaccgatgtgaa'aaattcgaacgttgaaga 

gaagtatttagttgatagagaactaggcaggggcgaattcggaattacatacctttgtatag 

atcgtaacagtaaagagcttttagcttgcaagtcaatttcaaaacggaagctacgaacagct 

gtagatgtggaagacgtgaggagagaagtagccataatgaagcatttgccggtgaattcaag 

tattgtgagctttagagaagcttgtgaggatgaaaatgcggtgcatttggttatggaattgt 

gcgaaggtggggaattgttcgataggattgtggcgcgaggacattatactgaacgagctgct 

gctgctgttacacggacgattgtggaggttgtgatgctttgtcataagcatggtgtgattca 

tcgagatttgaaacctgagaactttttgtatgctaataagaaggaaaattcgcctcttaaag 

ctattgattttggcttgtcaattttcttcaagccaggtgagaggttctctgaaatagtcgga 

agtccctattatatggctcctgaggtgctcaaacgaaactatggaccagaaatagatatatg 

gagtgcaggagtcattttatatattttgttatgtggggttcctcctttttgggccgaatctg 

aacaaggtgttgctcaggccatcttacgtggggtgatagatttcaaacgggaaccctggcca 

agtatttcagagagtgctaaaaatcttgtacggcaaatgctggaaccagatccaaagcttcg 

actgactgcaaaacaagtacttgaacactcttggcttcaaaatgctaagaaggctccaaatg 

ttccccttggagatgttgtgaagtcaagacttaagcaattttctttgatgaataggtttaag 

aggaaagctctgagggtgattgctgatttcttgtctaatgaagaagttgaagacctcagaga 

aatgtttagcaagatagacaccgataatgatggaattgtttcagtccaagaactaaaagctg 

gacttccaaagctcaactcacagctggcagaatctgaagtacaaatgcttgttgaagccatt 

gataccaatggcaaagggaccctggactatggagaatttattgctgtttcactccatcttca 

aaggatggctaacgatgaacatctgcacaaggctttctcctactttgataaggatggaaacg 

gttacattgaaccagatgagcttcgagatgccttgatggaggatggagcagaaaactgcgcc 

aatgtggcgaatgacattttccaggaggttgatacagacaaggatgggcgcatcagctttga 

agaatttgcggccatgatgaaaactgggacagattggagaaaggcttcacgacattattcaa 

gagggagatttaatagtctaagtgtgaagctaatgaaggatggatcgcttaacttgggaaat 

gagtaaggtttacattttttcatcaaaatgaagtattgtatcgatgtgtatttgatctcgat 

gtgtatttgatctctcgccattgttttctggggtgcccattagattgtttgcttgccaggat 

ggaaaaggggcgacttcatctgggtaaccgttgtaaccatttgaaacacagaatgtatcctt 
ctactccc 



FIGURE 4 {continued* 
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ccLgcgtccgcttggacttggttacaaaaacagtagccttaaatagtccagctactgccct 
tgccttgtacatgcaatgcaagtacaggctgccgccatggcgaatttctcaccgttgctatt 
aLgacgatttggctagtgcttgtaatttgtaaaggagtagagagtggtcattcctc^ 



^ggaratccaggaatgataacagatggcttaaggatagctttagaagctt^ 
aa?gaagttggtgaagaagctcctggaatgggtagccctagagctgctgattgctttcatct 

??c?gacagttctctgactcacaaggtaaccgagtcggataataagctaggagttggcaaga 

ca?t?Ltggcctgagtcctaaggctaagaataatccggacttatatgct g ttgaaaaggaa 

ctctatcttggttcattgtgtgaagttgatgacacgccgaggccatggcaattttggatgat 

aatqttgaagaacggaaattatgacacaaaatctggtctttgcccagaaaatgggaaaaaag 

rgcfcc 9 c"?taa?Ltggaagatttccttgttttgggaaaggatgtatgaatcaac^ 

ttqtatcaccagcccacttcattattagccgatgatattatgcggggaggttttaatggtac 

ctatgatttgggttcttcaacgggtggcagtagttccttctttgaggtgctctgggaaaaga 

aagt?ggcacagggggttgggtatttcagcacaaactcagaacctccaaattgt^^ 
ctgatgttgtatcttagggcggacgcgaccaaagggttctctggaggctaccactacgatac 
cagaggaa?gttaaaaactctcccggagtcacctaattttaaggtcaaattgaccttggatg 
tgLgcgagggggaggaccgaagagccagttttacttgatagatattggcagctgctggaag 

cattaa?Lagaaactccagcttggtgcagccccacaaatattggcaactgcccacct^ 
acatcacaccgaacaatactaaaatctacaggaatgacacctctcacttcccttactcagct 
tatcactattattgtgctcctgggaacgccgagcacttggaaaagccatatagtacatgtga 
tccttacagtaatccccaagcacaggagctagttcagttgctgcctcatccaatatgggcag 
actacggctatccaaccaaacaaggagacggctgggttggggatggaagaacatgggagctt 
gacgttggtgccctttccagcagactttacttctatcaggatccaggtacacctcctgctag 

LgLtSggacatctctggatgtggggactgaaattt^ 
agaatggactctgagcgactttgatgttttaatcacctcgtaaagccataataatgatacc 

.ttctatttaacattgtaactgtagccaaagcaaaatcagatagtgggacaaggtctcatca 

ttcttgatgtctaaactttatctttctatactagatctgatctgacggggcaagtcctggca 

gctttatttccgagagaagaaaaaagaattttgtttttgctttaaaaaaaaaaaaaaaaaag 

ggcggccgctctagagtat 



ca 
c 
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ccacgcgtccggcttcactatcttgagctggccatattttcatgcttgcaggaactgattta 
ctgctttcgctgggtggcgtttttcaagtttaaagtttaggtcaaactcagaggattcgtca 
tagtttacatacttttctgctggtcattaacgaaactatgtgtactgaaacacgggggtgat 
tcaattcttctttgtcacagatattaaagctggactaaagcatctacagactggactctatc 
ctgtagttagatatgcagactaagaaaaaattaaatggaagaaatccccgagagctggctag 
tccaaaggtttcaagacagcagcggaagatgtccgagaatgtgcaaactcaggcaaagcaag 
ttaaggaacttataacatctacagtgaggaagcaaaaatcaggaagcaatttcttgaaaaag 
attgagaattatgttgctgctacagatctggatgtaagatttggattggtgtctgatgacac 
ttctgctgcttcagacgcacatgatgctgttcatgaatataatactattactattaataagg 
actataatgttgaaactgatagttgcacaaatgatactatattttctcctaccttccatata 
tccagaactattggaggggaaatttctaacagagcagacatacccaaattcattgagcaagc 
agaccagccattgcaggagcctggaaaggaaaatatggaagttgatctgctgacaagtcatt 
ttgtgctggacgaggctaccgatatagggggccagcatatctcctctgaagtttcagctgtg 
catctctctattaaagattcaaaactggaatgcattgatgaatttaatcaatttcagttgcc 
tgctgatgttagtatggaggaagaggaaactgaagagtttgatgactttgatccatattttt 
tcataaagaatttaccagacttgtactcagttgttccaacatttcggcctgtgctattgcct 
aaacaaacacggagttgcccatcaaccactcttgttttggacttggatgagaccttggtgca 
ctctacacttgaaccttgtgatgatgcagatttcactttctcggtgaatttcaacctgaaag 
atcataatgtatatgttcgatgccgtcctcatcttcgggattttatggatagag;tatccagc 
ctatttgagattatcatatttactgcaagccaaagcatttatgctgagaagcttctgaatgt 
gcttgatccaaagagaaaagtatttaggcatcgtgtttaccgtgagtcatgtgtatttgttg 
atggcaattaccttaaagatctgtcagttcttggccgtgatttagcacatgtgattatcatc 
gacaactctccgcaggcatttggattccaggtggacaatggtattccaattgagagctggtt 
tgatgaccgctctgacaaagagttgctctctttgctcccatttctggaaagcttagttggag 
ttgaagacgttcgaccgattattgctagcaaattcaaccttcgcgagagaatagctgctgct 
gctacttgtccttttaactctattagaggtgatgcatttgagagatagggatccgtgtcttt 
atagattcagtcttggttacttgaattttagatttcaatggctctcgatgagttgcaggaat 
cagttctaatgtacctttgcggatgtgagtttgctagaggctgatctctaatgttggttaat 
ttatgtaattcacatttatgtaatggtgccataacgacgcttgagattggaggaaacttctc 
aataaggctgtatctgaaacgtgaaatcatccaagcgag 



FIGURE 4 (continued) 
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ccacgcgtccgcttagggttccaaattgctctaaattcccgcggattgagagttcattggag 
acttccattgttcccagcggctaagatgagccggttgattgagcatcacctagcaaataata 
aacaggacatgaaagggacagaggtttttgttggtggtttggcccgtactactactgaaagc 
aaaattcatgaggtattttcttcatgtggtgagattgtggaaat'acggttgataaaagacca 
gacaggcgttcctaaggggttttgctttgtacgatttgcaacaaaatatgctgctgacaaag 
ctctgaaggaaaaatctggatatgtgctggatgggaagaaactcggggttcgcccctcagtt 
gagcaggacactttatttcttggaaatcttaacaaaggttggggtgcggaggaatttgagag 
tattgtgcgccaggtttttccagatgttgtatctgttgatcttgcacttcttggagatgtcc 
aacctggtcagaagcaacggaatcggggttttgctttcgtgaaattcccatctcatgctgct 
gcggctcgtgcttttcgggtaggctcccaatctgattttctcattgatggcaagttacatcc 
atctgtacagtgggctgaggaacctgatcccaatgaacttgctcagatcaaagcagccttcg 
ttagaaatgtacctcctggtgctgatgaagattacttgaagaagctctttcagccctttggc 
aatgtagagaggatagctctatccaggaaaggtagctccaccattggattcgtttacttcga 
taagcgatctgatcttgacaatgctattatggcgttgaatgagaaaactgtacaagggccaa 
tgggaggtccctcatgcaagcttcaggtcgaagttgctaggccaatggacaagaacaggaaa 
cgaggtcgtgaggatccaaacatgtccagtaccattgagagtcattccaagcttttgaagga 
tgatccagatgttgagatgattagggctcctaaatcaactgctcaactggagatggattatt 
cggatccttatgaagctgctgtagttgcattacctgtggttgtcaaggagcgtttagttcgg 
atcttgcggcttggtattgctactagatatgatatagatgttgaaagtttaaccagtcttaa 
gatattgccccagtcagctgccatatctattcttgaccagttcatgttgtctggagctgata 
tgcagaacaagggaggatatctagcttcattaatttctaagcaggttgaaaaactgggaccg 
aaacaattcgatagtaggtcaaggatagaagatgttggcttgagggtgccagaaccagacag 
gttctctacaagagttcgtttgccagatctagattcatatgcctcacgagtacccttgccca 
tgcctaggactgatgtttacacatctcactattcagcgtatttagatccccatctgtctggt 
cggatgacagcaaagaggatggaggaagcaagttcccatttgcaggcgacttcacttctgtc 
tagtcgggtggcaacgaggatggaggaggcaggttccactttgcagtcgctcctatctggtg 
gggtgacgacaagaaggatggaggaagcaagtccgattttgcaggcaacactccttccatct 
ggtcgggtatcaaggatggatgaagcaagtcccaatttgcaggcaacatggagcccttctcc 
tactaatgacagaattggacttcattcacacattaccgcaactgctgatcatcaacatactc 
gaccacggatcaggtttgatcccttcactggtgagccatacaaatttgaccccttcactggc 
gagccaattgttcccaagagctcaagtcatcatcgaagcctgtactgaacgttctgagcatt 
ctaatttacaaatggcttattgccaaacctatgtaacataatgatgcgtatttttgttcatc 
cgcagctgtaaaatagtagctgttagcaggattatttggttatgtttctcattgacttcatt 
gattgcgaaggtgcatttggaatctcggcaatcacaatttatagccggtgca 



WO 03/085115 



57/140 



PCT/EP03/03703 



SEQIDN095 

ccacgcgtccgcccacgcgtccgcaaaccctcccgcgaagctaaatttcccctttttctctc 
tctctttgattcgaataagagaaattgggggtttacagtaattggggttttcgtatatttag 
ttctgtaaactcatccctcgactcgattcttcttttgatttgcattgattatcattagattt 
gactttgattttcaattcaattctaattgatggaggatactaatcagcagaacgtcgatcga 
ttcacgtctgtttcatcttcaggtgaaagagccgttgagccacataatgctgcagaacagcc 
tatttcgcccaaagatgaaaggactgtttctgcaaatgcttctgtgaatgcaatcatccctg 
gggctttaagaaatgctaaagatcaccctgttacctctgaaactggagctctgtccgccttt 
tatcctctcaattcctattctcctcaggaccaaggtttttactatggaggttacgacaacgg 
cactgggagttgggccgaacaatccaatgatgtcaatgtgaacttgcatgtagttccgccag 
caatgtacaatgagaatcccctctttttccctccgggttacggctttgatgctcagatggca 
tttggacagttctcccccatcgctagtcctctttctccttttatgatagatggccagctata 
ctccccacaccagattccggtttctccaaattactatgcaccacctatttcccctggcttgc 
cgcatgttacatcagctcttccagcttcgcagcctgatctggtggcaccaggaagcactggc 
catgaaattgatagcatgtattttgggccaggatcaggttactacatacccgttggatcgtt 
tggcggaggcgagctttcgggaagcagcaacattggtttctacaattaccaaggtgaatttg 
gatctggtcaatctttacctaatcgacctaaccccctggactctggaagatacatgtctcaa 
atgacatctgcggcactatatccacaaccagttggcatacttgggtcgtacgaactaaacgc 
catgcaggcttcacatcaaggtcttggattcacaccaggctcctcaggcaggaattattccc 
aaggcaatccttatcctagtgcaaactatggtactgggtctagttctctgtgggaaccaggt 
cacagaaattggctaactcctgacagaggtggaagacgtgagagggatcggcactctgttaa 
catttctactgaatcacttggtatggcaagtgaacgaaaccgaggaccaagggcattaaaac 
caaagagcaagggtcccattgaggatagctcttcatctgtcatccgtaaagaagttgagtcg 
actaatactttgcagcctgagcagtataatcggcctgaatttgtcactggttatgaacatgc 
caagttctttgtcatcaaatccttcagtgaagataatgttcacaaaagcatcaaatatagtg 
tgtgggctagcactcctctgggaaatggaaagcttgatgctgcttatcgtgaagcaaaagag 
aggaatgctgattgtcctgtttttctctttttctcggtgaatgctagtggacaattttgtgg 
ggttgctgagatggttgggcctgttgattttgagaacaatgcggagcactggcagcaggatc 
gatggagtgggcaatttcctgttaaatggcatgtcattaaggacgtgcctaacagtcagttc 
cgccacctacttctggaacataatgacaacaaaccagttactcacagtcgagattctcaaga 
ggtgaaattgtcagagggactagaaatgttgaaaattttcaaaaactatgaagcggatacct 
ctatattggatgatttcacctattacgatgagagggagaagtccttgctggaaaagaagagt 
aaacagcgaacacttcaacctggtagtgctgcagttactactgcagctgacacaataagtca 
actagcggatagtcttgccggcacattaaacttggaaggcaacaagaaattgccttaaaaag 
agtttgtaatgcttaagcctgtagcagattccagaggcaatatcaactgctgtcattcaatg 
ttagttggttgggcaaccagctggctcgtatagttaaggagattgctaacagcattttggga 
ggtggccttttcatcacctttggatcagaaaactctctttatctctttcctttatttgtcat 
ttgttcaatgagttggttgtgatttgcaattcttgggcaaggagaggcagccagtaatgtta 
tgagctatcgtttaccc 



FIGURF A /continued) 
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ccacgcgtccggggttggcaagaaaaattctaaagagaaaaggaggaacacattgattttac 

gaggactagtcattcatttttcttggacagtcttggaaactaacagcttgattgctgaccct 

ttctcctatcacggctagacagccaaactccacattcttatataaagaccacccttttcatt 

ttggatttggtaaaacaaaggaagtccagaagataatcagagaaagatgaaatttgggaaag 

aatttgcatcccaaattgtccaagaatggcaagaagcctatgtggattacaattatctcaag 

agtgttttaaaagacatcttgaatttcaatattgccgcttcacctgaagttgaaggctcctt 

aaagagaaggctatctatgtacagagcctttagtggattacaaactagtttcaaagtttctc 

aaaacaatgaagatgaagccatattagtgagttcagaaggccactatcaaactatgtttctt 

atgtcatctgaaaaaggtggagaaaatgagatggttttctttaaaagacttgatgatgaatt 

caacaaggtgataactttttaccagaaaaaagtagaggaagtgaaggctgaggctgatgagt 

tgagtaaacaaatggatgcacttattgctctaagaatcaaggttgataagccttctataaga 

atcaaaaattcccatttgggaaatccaggtaggtcagaaatggaggcaatacaagaagcaga 

gatgacaagtgaagaagaagaagcaacaagagggaaaagagatacagcaaatacaaaacata 

tggaatttaggcctgctccactagagattttggaccatataaaaatcaatcttgaacccgaa 

acacctgtctcaactttaaaaaatatcatccatacttcaaaatccaacttatcattcagcaa 

agaggagctcagaaaagctgaagaacaaatgagaaaggcttttgttgagttctatcaaaagc 

ttcgacttctgaaaaacttctgtctcttaaatgtgttggcattttccaagatcatgaagaag 

tatgacaagatcacctcaaggaaagcttctaaatcatacttagagatggttaataaatctta 

tcttggtagctctgatgaggttgctaagctcatagaaagagtggaggccacattcataaagc 

attttgtcaatggaaatcgaaggaaaggaatgaaatctttaagaccacaagctaaaagagaa 

acgcatagagtaacatttttcctgggtttgttctctggcggctcaatagcattagtggcagc 

tattgctgtatccatacatttcggaaaccttctacagcatgagggtcgtgggcagtatatgg 

aaaatatatttccactctacagcctattcggatacattgtcctccatatgctcatgtacgcc 

gggaacatatactactggaggcattttagagtcaattatcccttcatttttggcttcaagca 

gggaacagaactaggttacagacaagttcttttccttgcttctggtctttcagtacttgcat 

tggctgctgcattgtcccacctagatatggagatggatccaaatacaagaagttttgagaca 

gtgattgagctgatcccacttgccgtggtgtttattctgcttctaataactttttgccctct 

gaacatcatatatcgttcaagtcgcttcttccttataagatgtggttggcactgtctatgtg 

ctcccctttacaaggttaatctaccagatttttttcttggcagatcagcttactagccaggt 

tcaggcaattaggagtttgcagttctatgtctgctactatgtgtggggcaacttcagaacaa 

gatctaataaatgtcaagaaagcagtgtttatcaaatcttatacatagtcgtcgcaattatt 

cccttttggtctcggtttattcagtgccttcgccgcttatttgaagagaaagattcaatgca 

ggggcttaatagcctcaaatatttctcaaccattgttgctcttgtgatgaggacactttatg 

ctcaaaagagaggaacgttttggagagtaatggcggcatcatcctcaggaattactacagtt 

gcaaatacttactgggacattgttatagattggggtttattgcaaaagaattcaagaaaccg 

ttggttgagagacaaactgcttgtgccacacaagattgtctactttgttgccattgttcttg 

acattattctgagactagtatggatgcagttggttcttgattttcaagaactaccatttctg 

cacaagaaaqcaatggttgcagtagttgcctctctagagatccttcgccgaggcatgtggaa 

ttttttcaggttggaaaatgagcacttgaataacgtcgggaaatatcgtgccttcaagtccg 

taccattgccttttaactacgaggaggacaagagtctatatctatacctctagctgatacgc 

agaagtcgaaggaatccagggttttcttttctttctttttttttcttgcacaaattcttctg 

attcgttgccgtatattggt 
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ctgtaaaaaagagtgcatcggcaatggaagcattcgagaagcttgagaaggtaggggaaggt 

acttacggaaaggtgtacagagcgagagatagggttactggcaaaatcgtagcactgaagaa 

gacgaggcttcacgaggacgaagaaggtgttcctcccactactctccgcgagatctctcttc 

tgcggatgctctctagggatcctcacattgtcaaactgatggatgttaaacaaggccagaac 

aaagaaggaaagacggttctctacttggtctttgagtacatggatactgatgtcaagaaatt 

tattcgtagtttccgcgcaaatggagaaaacattccccctaaaactgtcaagagcttqatqt 

accaactatgcaaaggagttgctttctgccatggtcatggcgtgttacacagggatctgaaa 

ccacacaatcttctgatggaccgtaagacgaatgtgctcaaattagcagattttggacttgg 

cagagcttatactctgcccatcaagaagtacacgcatgagatattaaccctatggtatagaq 

cccctgaggttcttcttggagctactcattactccacagcagttgacatgtggtctgttggt 
^^ttgctgaactggtcacaaaacaagc^^ 

actgcttcacattttcagattgctaggtactcctaatgaagaactctggcccggggtgagca 

agctagtaaactggcatgaatacccccaatggaacccccagccactctcaactgctgtccct 

ggtctagatgaagatgggctccaccttctaactgagatgttgcattatgagccagctaagag 

gatttcagcaaagaaagctatggaacatccctatttcgatgatttggacaaaactcctctct 

gaagtcccgctcatgacccatctgttgaaaaattgcaaatttctcatcaccggagatcaaca 

aacccatctaacccctcatcgcaagcttttattgcttttctcaagcatcttttaatagtatc 

aattagtatgactagcttcacctaaaaactttgtctttctatatcaattggatcagtgtagc 
acaattatgtggaatgatagaaccgca y y 

SEQIDN098 

ccacgcgtccgatttcctcggctattttctgcactgactcacgatttttcggacgctttgtt 
ctcgccgtagcgcggatattatacactttgtacaatctctgtagtgatcgccgattgatttq 
^r^ 9 ^ gaagtt9tCCttgCCgaaaattttCtCtCa ^ tc ttgtgag g cagg?ggctg 
cagttgttgtaaaggttgaagtagctctagacaaaagcatttgcatgttgaccagatgagca 
gaactgatgttatttgcagtagaaggaggaggtttcttctcgtcttcagcttctggatataa 
taagggcctgacccttctactcttgggtcagaagaacgaagagaagcccatgagagttgcac 
cgtggaaccagtaccagttggtggaccaagaaactgatccggacctccagctggcttccggq 
aagaacagggttgtccgcgggtgcgcctcctttgtatgctttggtcgcgctgccgctggact 
tgagagcccatctccccttaaagtcggtcctacccaacagccagaagtcttgcctagctgtc 
ctgcttctgacaaggacaacaatcagtcgcagtgtgttaatattattgaagacagtcatatc 
tcaccaaaggttgctcttcggagtagcttaaagaaaccagcaaatagtatacccatttctgg 
tggtaauggtaatgaacgcggcacaaattctctaaagattgatgatgcccccaatcctatgg 
agaaaaggaaagtgcagtggacagacacatctggaggagagctttttgagataagggaattt 
gagcctagtgatgatggtgaatcagatgatgaatttgagagtgggaatgaaagaacttgttc 
ttgcaagataatgtaattttagctccttatagaaggtttcatggctgattttcatgtaggag 
gcaacaattgaggtgctgcagatgatattacgtgggaggttggtttcgacacatgccttttg 

t J ag ^ gacacaagt 9 aat tttgaggctttattaggtagcgtcacgttgatcat 
gggcttttctggttttggctgactaatctgttgtctatattcaaattcttgaatgtcagttt 
tgtttctctggggcccgctcgtctattgttttcataattatattttatcttcatttttttaa 
ttaaaagatttctggtcctcttttatccaaaaaaaaaaaaaaggacgaccgctgtagggt 



FIGURF A Ir.nnfi 



WO 03/085115 



60/140 



PCT/EP03/03703 



SEQIDN099 

ccacgcgtccgagcatatattcttcttcttcttccttttggcttctatttcattagcttata 
aaaaccaaaacacaaacccacaaacaaagaaacgttaattagctttgtggagtggagagacc 
ttttattgaaagggagcttaactgttggaccagctgagcactacttgccaaagatttgaact 
ttcttggttttttggactggaagttagataaagctcaaatcttttgggtttattttgtatct 
ggtacagtttcttgaacaagaatagcaggaccttcaaggctgaaaagggaatattttcggct 
tttttggagtgatttgttggatgatcaggaaattgttttgtatttgagggaagcagttaata 
aggtagaatagaaatgatggatagggtgccaagattgcttatggaggttctaacagaaccgc 
aacgaggaggagagtctttactcgggtcgattaagattgctgttttaccgattgcaaaagtt 
tttaccatgtgcttcttgggatttcttatggcctccaagtatgttaatattcttccagctaa 
tggacggaagctcttaaatgggttggtgttttcacttttactgccctgcttgatattctctc 
aacttggacaagccatcacatatgagaaactgcttcagtggtggttcatccctgttaatatt 
gttatcgccaccatatctggctctattataggttgcatcgttgcttcaatcgtccgtccacc 
atacccttatttcaagtttaccgttgtacaaataggcattgggaatattggaaatgttccac 
ttgttctgatagctgcactatgtcgggataaatcaaatccttttggagactacgagatatgt 
tcgcgagatggaaatgcatacatctcatttggccagtgggttggtgcaatcgttctctacac 
ctatgtgttcaatatgctcaaacctcctagtgaaggcactttcgacgttcaagatgcaaatc 
ttcctatcaaaagtcctaacaaagatggctcgcagagccatctggtagctagttcaccagag 
caagttccattacttacaacagacgtagcaccggctgactcaagtggttcaaagaaagaaaa 
ggttaaagagttctttaattttctatatgagaaactgaagctcaagcaaattattcaacccc 
ctattatagcttctgtcctagccattgtcataggatgtgtgccaatcctgagacgacaggtc 
tttacttctgatgctccactttacttcttcactgacagctgtttgattcttggggatgccat 
gattccctgcatattgctggctttaggaggcaatcttgtcgatggaccaggacccggaagtt 
caaaacttggcctaaggacaaccgctgcaattgtctttggacggctggttttggttcctcca 
accggacttggcattgtcatgttagctgataagcttggattccttcccgctggtgataaaat 
gttcagattcgtactcctccttcagcatacgatgcccacatccgtactttctggtgctgttg 
ccaacttgagaggatgtgggaaggaggcagcggcggtattgttttgggttcatatttttgct 
attttctctatggctggatggatcatcctctacctcaacatactcttttaagttaggatcaa 
acagtgttgctacaaagagtaaaaagaagagatcttgggatggaaggtttttattcctgtta 
ccaggatcgcgccagcctttcgtaaagctgctgtttttagctcattcaattgcctcattgcc 
atttgagactaagagagagatgtattaatattatgtaggaatattacctactacatctataa 
gtataattagtcatgatggagttaaccaattgctccttatttgttcttggcttcttctactg 
tataaccttagcttatgctaccttgaaactggctatgtcaaagttggacttggcatttggca 

gacaaagatgagacatgatgttcattggaagaataagtaaacgttgaacagc 
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ccacgcgtccgagcttaagaaaagaacacttgccctgctgagtatctataaccatatagata 

ccagtaccataattttgcatcatcttcaactctaagggatcaaagctttattgtccaaagaa 

aaaacttatggactcccctcataagccaaggtctttttcacctaacctattctttttcttcc 

ttcttgtatcctcaaatctcctgacttttttcatttctaacatatttaagaactcttcttgt 

tctctataccagcaaacatataaagccattgccactgcttcattaaacaatgctatcccttt 

tgttgttaagtcagaaactagagatgttgttcatgtatctgataaaccagcagctgatttag 

acttaccatctgagttccttgctttcacatctccacatcaactgccatttggagtcaactcg 

agctttaattctgacaaactcatccctcccgttggccgtccatgtactatgtttccagattt 

acttcggcgttacatgtcatacaaaatcaatggatcttgtcctgatgatgagctcctggcac 

agaagctgcttctcaaaggttgtgagcctctccctcgccgtagatgccgtcctgcagctcaa 

caggaatatgttgagccttatcctcttcctgagagcttgtggattactccatcagattcatc 

tgtagtttggacagcatatacatgcaaaagctatgaatgtctaatcaacaggaaaaaaaatc 

aaaaagcatttgatgattgcaaagactgctttgatctcaatggcagagagaaaaatcgttgg 

ttgtccaaaaagggagctggccttgacttctctattgatgaagtactagcagtgaagaagcc 

gggtacaatcagaatagggcttgacattggtggaggcgtagctacatttgctgtaagaatga 

gagaaaggaacataacaatattaacaacttcaatgaatctcaatggtcctttcaatacattt 

atagcatcaagaggagtcgtacctttgtacataagcatttcgcaacgacttcctttctttga 

caatacactggatatagttcactcaatgcatgtgctgagtaattggatacccacaactctgc 

tccacttcttattattcgacatctatagggtgcttcggcctggtgggctgttctggcttgac 

cattttttctgtgttggtgagcaatttgagcaagtctatgctcctcttattgacagcattgg 

gtttaataaggtcaagtgggtcgttgggcgaaagatggacagaggccccgagctgaatgaga 

tgtatctttcagcactgttggagaagccactgaagaactcttggtgatagtattagatgttt 

cttttactttcttacttttgatagttatagaagagaagatagaaggaggtgtttattttttt 

taaaattatagattcatttcagatgacatcttctggcataaactagcagtttgaggtagctt 
gagtatgattttgtaatgttggtgggctaaaccttagagctttagcggcc 

SEQIDNO101 

ccacgcgtccgtgatgctgttcatgtatctgataaaccagcagctgatttagacttaccatc 

tgagttccttgctttcacatctccacaacaactgccatttggagtcaacccgaactttaatt 

ctgacaaactcatccctcctgttggccgtccatgtactatgtttccagatttacttcgtcgt 

tacatgtcgtacaaaatcaatggttcttgcccggatgatgagctcctggcacagaagctgct 

tctcaaaggttgtgagcctctccctcgccgcagatgccgtcctgctgctcaacaggagtatg 

ttgagccttatcctcttcctgagagcttgtgggctactccgtcagattcatctgtagtttgg 

acagcatatacatgcaaaagctgtgaatgtctaatcaacaggaaaaaaaatcagaaagcatt 

tgatgattgcaaagactgctttgatctcaatgggagagagaaaactcgttggtcgtcgaaaa 

agggagctggccttgacttctctattgatgaagtactagcagtgaagaaggctggtacaatc' 

agaatagggcttgacattggtggaggtgtggctacatttgctgtaagaatgagagaaagaaa 

cataacaatattaacaacttcaatgaatcttaatggtcctttcaatacatttatagcatcaa 

gaggagtcgtacctttgtacataagcatttcgcaacgacttcctttctttgacaacacactg 

gatatagttcactcaatgcatgtcttgagtaattggataccaacaacactactgcacttctt 

attattcgacatctatagggtgcttcgacctggtggactgttctggcttgaccatttcttct 

gtgttggtgagcaatttgagcaagtctatgctccccttattgacagcattgggtttaataag 

gttaagtgggtcattgggcgaaagatggacagaggccccgagctgaatgaaatgtatctttc 

agcactgttggagaagccactgaagaactcttggtgatagtattagatgtttcttttacttt 

cttacttttgatagttacagaagagaagacaggaggtgtttttattttttattttttatttt 

tttttaaattatagattcatttcagatggctcttcaggcataaactaacagtttgaggtagc 
gtgagtatgatcttgtaatgttggtgcggcta 



FIGURE 4 (continued! 
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SEQIDNO102 

ccacgcgtccgcaaaatccacaacaatctcaaattggattttctaatctgtaatattaatta 
cttcattcaaattatgtaaattcttttgtataaaaacccttaaaaacacaatcttttcatca 
attctcaattgggtttctcttcttaatctgtaagtttttgttacttcattcaattttgtata 
atggattctgatttttctcccgggtgtgggtcgggtatacaatcagactttgcgttcgcttt 
caatgatagcaatttctcggatcggatcttaaggattgaaattgtacccgacttgccggatt 
gtaaaacgggctgtgaaggttgtactggcggcattgatgattgggcccggaaccgcaagcgt 
aggagagaagacatcaagaaagaaaatgatgcggacgtggtcatgcaacgtgaggagcaagt 
agtaaattgtaatgtgcttgaaatggaagatggtcttgctgatgatgaacaagatgaagaag 
ctgtaggaatgcttgaggaatcaccctctggcattgagatgaccacaaatccccctggcgat 
gatgaagcttctaaaagcgatgatgattcatctacaaacatggactcttcaaccccccttcg 
ggtgagaactatacatatcagttctcccattttggcagctaagagtccattcttttataagt 
tgttctcaaatggcatgaaagagtcggaacaacggcatgtaaccatacgaatcaatgcgtcg 
gaagaagctgccctcatggacctcttgaattttatgtatagcaatactttatcaactacaac 
actcactgccgtgcttgatgtgttgatggctgctgacaaatttgaggttgcgtcatgcatga 
gatactgcagccacgtactgcggaatcttcgcatgacttgtgaatcagcattgctttatttg 
gatcttccttccagtgtactaatggctgatgcagttctgccgttaacagatgctgcaaaaca 
gtttcttgctgcacgtttcaaggatataaccaagttccaagaagaggtattgaatttgcctc 
ttgcgggaattgaggctgttctgtccagtgacgatcttcagattgcttcagaggatgctgtc 
tatgactttgcgttaaagtgggctcgcatccattacccaaagcttgaggaacggcgggaagt 
attgagctcacgtctttgtcgactcattcgatttccatgcatgacatgcaggaagctgaaga 
aagtcctaacatgcaatgattttgatcctgagcttgctacaaagcttgtcctcgaggctctt 
ttttataaggccgaagcaccatatcggcaacgctccattgctgcggatgcagggaatgcttt 
gtgccatcgttacatggagagggcatacaaatacagacctgttaaagttctcgagttcgaag 
cacctcgtcaacagtgtgttatttacctagatttgaaaaaagaagagtgtgctagcctcttt 
cctgctggtagagtttattcacaggctttccatttgggtggacagggatttttcctgtcagc 
tcattgcaacatggatcaacaaagtgcattccattgctttgggctgtttctgggcatgcaag 
agaagggggcagtgtcatttgcagtcgactacgagtttgcagttcgtaccaagccaaacgag 
caatacatgagcaaatacaaagggaactacactttcactggtggcaaggttgttggctacag 
gaacctgttgggtgtaggttggagcgcgtttttggctgacgatagtgcttacttcatcaatg 
gacttctccatcttcgagctgagcttactatcagccaatagagagtttaaatactatcgctg 
tgcttctgctgacagctaaactatactttttacttcagtgaggccttaagaagtttacattg 
tagtggcatcttacttgaaagtgcagcacatgtgagcaatagttgtatgggctatatattgc 
ttgttacctattggcatatatgcactggtgtaaattagtaaaatcagtctttgagcggttca 
tattttgacaatcacagtctttgttaagagttctagctgccc 
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SEQIDNO103 

tcaactgctttcgatcccgtggattcgtaaactcttggatgttttcctctgttoccaaa^r 

ttactgattatttcgataggagtgtgaagggtttggatgtttgtaacgcgataagggacgaa 
attgagcagatcaggcaatggcagaagcagatggagattgttttgtg?gLttgSqaa?ca 

UrttttJ 9 f ag ? attCtaat 9 ca ^tgttaaccatagaaacag g tcattcgggcga 
aacaatactcagaatgatcataggtctatggggcattttagatcgttatcgtggagJSatS 

9 CtaCtaatg 9 atta g c tttggctgtttttacaatgagttatgtgttg 

^L" atg ^ g L C !; g !^!"l g ! t . tgggCCggcCCaattt tgtgtcttcatgaaaggattt 



9Cg ^^"^t^^gaattgatcgatactgttcagttccJaatcacaga 

aaa^ 9 S! 99 ^ gga 5 c g cc ^gttagagaagtgtttcataggatcgttcggagc 

gg ?!?!!? 9 " g ^ ttgagaag9catctattatta g^catgagcataatatgatL 

agggagaa 

^;^" ata r atgtaaagaaattaggaaaaaa gg t ggatctttggtccatttttgS 
tcatcttgtttggaattgtataatcacatttctgagttaagttgctttttaaaaaLaaaaa 



aaag 

SEQIDNO104 
c 

a 



r 9 g CCgg ttttcttctcatccca ^tcgcactctagggttacgccgcctctatcagg 
aatcatgcctcgccgaagctctggaagatctgctcctcgtcctgcccctcgtgcggcccc? 
cgtcctgctccagctccagtacaccatgctcctccaccagctccLtgcaaag?^^ 

g t g ^?! ggCaCa r 9ggCtgtagatgCggtcatgggtccac g cacca ttcaaLcgaaL^ 
gttgcttccgaggtacctgctgcagcagcagctcctacaaccatcggtgctgggtctgatac 
ttgcagtatgcactctaaagcgttccaagactgcatcaatagctc^ggLgSacat^ 

S ? Cattttaataacttt ^actcattcttaatctgattgttgaaacagcga 

t gg aattatgacaaaaggcttggtggtattgatggagcaagtgaatttggttcttgat 
ttttgggtcaaataatttatgctgaaatatgaactttatagacttctta 



acac 



FIGURF 4 /c.nntiniiGctt 
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SEQIDNO105 

ccacgcgtccgcccacgcgtccgcccacgcgtccggccttggctctcactttcaaattcccg 
acctctttctagcgccgaaattaccagcacgcagagcaaggacaccattagccgttcggcca 
ctgacccaaatggaaatcggcggattagactccgatggccgtgaatttagaaacgcggacga 
gatgtggagagaagaagtaggagatggtgaccaccaaaagaagtctcaatggtataacaaag 
gcatcaattactgggaaggtgtggaagccacagtggatggtgtgctgggcggatatgggcat 
gtgaatgaggctgatataaaggcaagtgaggaatttctcaacaccattttgccagaaaggtt 
ccctgatgctggaagaggccgccatcttgtagctctggattgtggatctggcattggaaggg 
ttaccaagaatcttcttatacgatatttcaatgaggtcgacctactagagcctgtatcacat 
tttttggaatcagcccgggtaaatttggctcctgaaaatttaatggtgtcagagttgcacaa 
agctgccaatttttattgtgttccactccaggaatttactcctgatgctgaaagatatgatg 
ttatttgggttcagtggtgtattgggcatcttgcagatgatgactttatttcattcttcaag 
agagcacaggctggcttgaaacctggtggactttttgttctaaaagagaacattgcaaaaac 
aggatttgtattggacaaagaagataagagcatcacaagatcagattcatattttaaggagc 
tgttcaatcaatgtggactatacatctacaagatgaaggatcaaaaaggatttccagatgaa 
ttatttgctgtgaagatgtatgcattgactactgagatgccaaggcaaggtaataaacctag 
acctaaacggacaactaatagacctgctatcatcagatgatgaatatcacattggtgttgtg 
tggttttactaactttggatgaagtaattcataggttattgtttttaggtcacatgtatgcg 
agttctgtcaatgttatgttattgcttttggatataagttatatacattgatagtgaagaga 
tttgttgtgtactttagcttattgtaggttacttcttatgttgaattatttatgcaaccgct 
tttgtatcaatgtattctgctcttcttgtaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaccaatttaaaggtctgg 

SEQIDNO106 

tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 
ccgcccacgcgtccgcaaggcttagggatgacgttgcccgataggaatgctaagaaaaaggg 
gaagcaaaaggcttccggtgagtcaaaagactcgcacgtagctgaagctcttgataagctta 
gggaacagactagagaggctgttaaaggattggaatcagtggccgggccgagacctggtgta 
gatagcttggggaatgatgcaatgatggaggagtgggttaagcagtttgaggagctttctgg 
atctcaggacatggagtcgatagtagagaccatgatgcaacagcttttgtcaaaggaaatcc 
ttcatgaacccatgaaagaaattgaagaaagatatcctaaatggttggaggacaacaaagct 
aagttgagcacggaagattatgaacgttacagacgccagtatgaacttataagagatctgaa 
caaagtttacgagactgaacctagcaacttcaacaaaattgtagagcttatgcagaaaatgc 
aagaatgtggccaaccgccaaatgatattgttcatgagcttgctccagactttgatatatca 
tctcttggacaactatccccagagatgttggagggccaacagaactgccgtgttatgtgaaa 
actgaaatgtccccccgcttgaatgtcctgcttgttttcgtcacctttgtcacagtttgcat 

acaacatttattttgct 
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SEQIDNO107 

ccacgcgtccgctatacctagatgacatttaccttagcccttaagccaaaaaagaaagaaaa 
gatccattgcctcatcctctgtaatctcatggattcactggtttcaatcctctgcatcttct 
tcttcttcaatataattcttaccccagttcatgctcaagtgatctttgaggatggttactca 
gttaagacactgattgatggccacaagatcaaaattaaccctcactccataatttctgtaat 
gggtgctggcaatttcatcattcttgattctgctgccagtactttttacaccttatctttca 
acaaaaactctgaattttctatttcgaagttaactggtagtgagactgctggctatgtggat 
ggttctctggataaggctaagttcaacaaacccaaaagctttgctgttgattcaaaagggaa 
tatttatgttgctgatatttggaacaagcatgcaattagaaagattagcaagtcaggtgtta 
ctacaatagcagggggttattcactaaagccaggccgtgctgatggacctggattaaatgcg 
tcattctcagctgattttgaactttcttttgttcctgagagatgcactttaatgatctctga 
ccgtggcactatgttagtgcggcaaatacagcttaaggccgaggattgttcaagagattctc 
attctgctctaagagcagtttctacatggttcttaaccgtggggcttccctgcttggtctgc 
ttgattctcgggttggtcatccgtccttatgttatccctaatgaacatggcagtcgtcttcg 
gcgcaacatgacatggaagcacttcctaatcagtctggagagacaagttctgatgttctgct 
tcggcatcagaagcgtagttgttgactcaaagatctattcacttttaaggaagctcgtgtta 
cttactttctcccatctgtgcctaatgtttagtcctaaagtagtagtatgccagacttctcg 
taaacaactggctcctctcttaagttttgacgactctgaaagcaaagaatcagcaaaatcac 
cggtggcagctaacattttggaggatttgataacttttgatggaagtttggttaactccgag 
ctgactactaatcaagatgatgcagtgagcaaaagtaccgatgtttctgttgtagatagcat 
gatactagctaatctaaaagggtttgcagaacaggggattgcttcttcagggcgtgaagttt 
catcgagcatttcgagcttagttaaccgaaaaaagaacgtaacttagtgaagtctagcagta 
gtatgtattactattaacttttgcaactgttctgaaagttcatcggtctatctgctaccact 
ttcatgtacatagtggaacaagcaaatgactcaaggcccttttgagttaatatttcctagcc 

tgtgttttcttggttccaaaaaaaaaaaaaa 
SEQIDNO108 

ccacgcgtccgcccacgcgtccgcccacgcgtccggaaaagaatcgcagtttcgaagctatc 
agaaaatcccaaacaacaaccatgtcttactacccaaaaggctaccacggggaagatgacga 
cggagctgaattcgacgagtacgatccaactccgtacggcggtggatacgacatcgctttga 
cttacggtcgtccgcttccaccctctgatgaaacctgttatcagacttcttcagcttctgat 
gaattcgactatgatcgtcctcagtactcttcttatgctgagccttctgcttatggtgagga 
ggctcttgagactgagtaccaaagctattctaggcccaaacctcggcccactccttcttatc 
atcgcccatctgaggaagaaggcgaagcttatgagcagcctcaggccgattatgggtttcag 
cctgggatgaatcgtcctggcagtggatatggtggggaaagtgaatacggatccgggtatgg 
acgcaagagcgagtatgaagaacccgcttccgaatacggatccgggtatgggagaaagaccg 
agtatgaagaacccaaaccggaatatgtagaacccgcttccgaatacggatctgggtatggg 
cgaaagagtgagtatgaagagcccaaaccagaatatggatccgggtatgggcgaaagagcga 
atatgaagagcccacatcagaatacggatctggatatgggagaaagagtgagtatgaagaac 
ccgctccagagtacggttcgggatacaggaggaagagtgaatatgaggagccaagatcggaa 
tacgggtcgggttatgagcgtaggaccgagtccgaagagtatggatctggtggatatggaag 
gaagcccagctacgggcaggaggaagagggggagaggaggcccagttatgggcgttcaagct 
accagactgaggagggagaagggtacgagaggcctcgctatggaaggtctgaggaggaggac 
tacaggaagcctagctatgagaggcgtggtgatgacgacgacgagggctatggtcgcaagaa 
atatggtgatgacaactccgatgatgacgaggagaagaaacatcaccacaagcaccaccacc 
gcaaacactatgatgattgagcagtgtgctttaatcatctgaaccagatttatgccatacta 
agaactattacaaaataaaagttggcaagtttgagatacattttgtttgtgaatgtttgcta 
tgatggctggactgtccagttatttatgtgatgtattttgctcttctgcaaatcccagacat 
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ttgtcagggttagtatgccatgaatgtgtgaactttatgatcatgatgactcttttatctct 
taaaaaaaaa 

SEQIDNO109 

ccacgcgtccgcccacgcgtccgcccacgcgtccgcccgagccagacgttaaacgacgtcac 
tttaatgtaccctttccccaaaaattggggctttgtaaattcatttgaacaattcacaaatt 
gtagaatttagggttctttttcagtaatggaaagtgggatttgtagtcccacaagatgaaaa 
gcagtcaagaagaagcgagccgcggcggcgcagcagatgaagaaagtgtgaaatttcacagt 
gttgtacagccacttagggatttagaatccaactggggtgttgatttagccaaaaatcttga 
agaatatttgctcaaaatttgctctggtgaaattactagagataattatgatgatggtcatg 
tgaattttgctgaagctgcattgctgcttcaggggtcagttcaggtgtacagcaggaaggtg 
gaatatctgtattctctggtattgcattgtttggaattcattaccaagaagagtgaaccaga 
tctaccagcaagtgtatcagcccaagaagatgaaaacggtttgcctgctgccgacaatgaag 
agaatgatccatactgggcttcagaagaaacctcagtggaagcaaagaacatgttggataat 
acgacgtgcagggattcttcatttacccagtttgtgaaggcccctgcaaatctggttgtacg 
cgaggctgactgcttggatgttactggagatgctggagaactagagtcttacctgctagcca 
cgtgtgatctttaccgagattttattctgttggacgcatgtgatgccgtaacagtggatgag 
tttctgaataatgagaatatagctggaaaggtgctgaacaatagctgcagtgcagagggcct 
ttctttggactccaagtgccacaagagcttttactctcccacaagacgttttgagggaactg 
gcaataagtcttcagctcaaaagaatcaggatgctaatttatatcagtctcaagggtttcat 
gagtttggtccaggcaattttaacaatgatcagttcgcatctgatatgcctgattacatcga 
tgatgcacatagatgtgaagatggatattcagaacctagagactcagacgaatcggatgatg 
aagacccatggaatccgttgaacccgcatgaacctggcactttgaaagtaaaaccatacaaa 
aaagttaaatttaatagaaggcagggtgcggcgtccaaaaaagttgcatctttggctacaga 
atttccagttgcgagattacatggtaccactagcgcagacctcaacgacatgtgggagagaa 
aatgttgtgccatgaaaaaacaaggcgactcacaatctcctccaccatatgagaagctccgg 
gaatcacttcttcatggggagaacaacgattatgatggtttggatagtccaaaggaaaagaa 
tgaaaatgatgactatgatagtgcagatcacgattttgggccttctgcctttgacatgccag 
aaaatgctgacatgaacaccgatgcaactccttatggggaaaagcacgataaatgtagtcca 
ttttttgacagtgaagctcatgaagattcgaatgctcatgccaaccttgaagatctttgtcg 
ctcccacttggattctcttcttgctagccttgctgaaactgaaaagcagagtgaattggctg 
cgggtttcaacgtggaaacagagaattgagcagaacttggaggaacaagaatcacatccc 
.ctttgacattcatgaatatggggctagggttttgtgcaagttatccctggaagaaaatgg 
tcaaagcaccaagtctttttctgatgttgtcacgggtcaagagaagcatgatattgctcgaa 
cattttctgcgcttctgcaattggtaaacaacggagatgttggtttggaaagaggtggaata 
cgtgagtccacttgttacacagctgcaaatcccttctatgtccggctccttaggaatgataa 
tggtagggagaaaatgcagattcggtcatcaagaaagagagcaaaatctccaatacccaatc 
agggctttagaaaggaaaaaaacaaaggtaaagaagttcaggctgctttcagttcatcacct 
tcagaacccaactcaaggttaccgatttgccctgaagctgggaaaggttaatggaactcgtt 
gtacgcctgaaggtaagaaaagaaggaaatccagattagtcgtaccaccagatatacatact 
gcattgtgatatacattttgctctagttttcaagtaagcctctcctctctcgctattcggtc 
tcactgtgcccgttgtatgtgagagactcaaggcagtaattctgtttgagtgtagtaagaca 
gaagattaaccccaccatgaccactgtaattcttatcacaaaccaacaacctgttggctgca 
gaaatttgtaagatgtgtttattcttaacttaattaggacttactaatagtttggagcaggg 

aggatgtaacaatattttgacatagtgcagagctactcatcatagctc 



ca 
ccc 
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SEQIDNO110 

ccacgcgtccgcccacgcgtccgccaaaatccatcacgaattgcattttcagatacgtgagt 

caactgctaatgggagaacacttggctctatgtgttgatcgtcttatcacacctaaatcttt 

gcactcgttgcaagggtcagaggatgcaggatcctctgcaggaagttcttgctcgcacacag 

taggtcaatcaccttatggtactactaataaggaggatgaagaactagaagctggaggtgaa 

gatgagccattacttcagactgtggaatgccgaatttgccaggaagaagatagcactaagaa 

tttggagattccttgtggctgcagtggcagcttaaagtatgctcataggaaatgtgttcagc 

gttggtgcaatgagaagggtgatataatttgcgagatttgtcatcagtcttatcaacctggc 

tatactgctccaccaccgccttctccttctgaagatattgccatcgacatcagtgggggctg 

gacagtggctggtactcagcttgacttgcatgatccgcggcttcttgcgatggcagctgcag 

agcgccatctcttggaggctgactatgacgagtatgctgattcaagtgctagtggagctgca 

ttttgtcgttctgctgctttaattttaatggcccttctattattgaggcatgctgtgaccat 

cggaaatgatgatggagatgacggtgatgtctccacctttttctctcttttcttgctccgtg 

ctgctggttttcttctaccttgctacatcatggcttgggctatcagtatcatgcagcgtcga 

aggcaaagacaggaggcaacagcacttgcggcaacagaagttgctttcatgctgcaggcagg 

gcaacataggggcttgcatgtaacaatagcaccaggacctgcacagttagctgaaccttcag 

caacaccagcacacccaactactcatgttgcaacaccaaccgcccaggcgacatcccctcct 

ccagagatggtataaatggctttgctcagtttgcttgttataaaatagttgccgataggggc 

attttactgttggtaagttgcacaagatggggatgagtagaagggtagaggagtattctttt 

ccctttttgctttttcgatttattagctgtatctttgcattgccaaatttggagtgcagagg 

ctgaaactttttccatttgttcaatttttcattaatgcttgaacatgtaaaaatataatagc 

gaacttagctgctttcaatgtggagataccatatcttcacatcgtgtacattgtttatatat 
taccattatgggttactcttaaaaaaaaaaaaaa 

SEQIDNOlll 

ccacgcgtccgatgagacttggagggtgtcctctggtttagctgaggcatggcgagacaaca 

caaatgttgcatccaagaaaaagtcattttccattgaaactgaaattgatgatgaggcgact 

agttatgcgtctttgaacgaggacggtcatgactttgatgagattgaggatatgaggatacg 

cgggaacttgttttacaagcttgataaagattccaaggaatacgaagaatataagtttgaat 

tccatagaaggaatacgaacaagaataatggaaatgacggtccaaaagagaaggaaaaatcg 

aataacgtttcagcttctagggtcgagaaaggtctaaagggtatagatgagaagcagcaaaa 

caagaaagagaaactgagctataactctgcctctccgtttcagaattttcagctaaatgatt 

tcggagcatctccaataaagaggttaagggttccaacttttaatcagcttactgccccttat 

catgagccgttttgtttggatatttatgtgtcgaaaggttcagtaagtgctagcattatcca 

cagagctactagcaaggttgttgttgtggcgcactctatttcaaaggacatgaaatttgact 

tgggatcaactaagaatagagctacttgtgctgctattggggaagttctggctcaaagagca 

ctggctgatgatattcataacgtagtttatacgccaaggaaaggggagaaattggaagggaa 

acttgagattgtacttcagtccattattaacaatggcatcaatgtgaaggtgaagattaagc 

agaggaaaaccaagaaacctggcttccaccgcccgacagcttaggtggtcatcctacattac 

gtaggatgaaattaaaagtgacaaggaagttttatcaacgtcttataagctcgaaacagcgc 

aatgtagtaagtagaacaaggtcagagatgtattactacctcttttgcgaggttgcagaaca 

tttccctaaattcagtctttaaatcggtttcaatagtagttacaaacttgggaataaatctt 

ttatttcctgcaatttgtattctctttatgagaatacattgctgttaatgtaaaagtgtgac 
tcgcag 
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SEQIDN0112 

ccacgcgtccgcccacgcgtccggctggaactttgctgtatcatcttcaaactcttgattag 
ttatattaaagactagtctttaaactcaatgggtgatagtcagtactctttctcactcacca 
ctttcagcccatctggaaagctggttcagattgaacatgcattgactgctgttggatctggt 
caaacttcattagggattaaagctgctaatggtgttgtaattgctactgagaagaagttacc 
atccatcttagttgatgaagcatctgtgcagaaaatacaggttttgacgcctaatattggag 
ttgtctacagtgggatgggccctgattctcgagttttggttcggaaaagtagaaagcaggct 
gagcaatatcaccgactctataaagaaccaatccctgtcacacagctggtgagggaaactgc 
tgctgtcatgcaggaattcacccaatcaggtggtgtaaggccatttggtgtttcactcttgg 
ttgcgggatttgatgacaagggtccccaactatatcaggtggatccatctggttcatacttc 
tcttggaaggcttcagctatggggaagaatgtctctaatgcgaagacatttctcgagaagag 
gtacacggaggatatagagctcgatgatgctgtacacactgctatactgactctaaaggagg 
gattcgagggacagatctctggcaaaaacattgagattggcattattggaactgacaaagta 
tttaaagttctcacgccagcggaaatagatgattacctacaagaagtggaatagattttctt 
ttccgcttaaggcattggaaaaagttgtcaggttggaagcgcagacggggtcatagcacaac 
tattggatgttcttgttggcttgattatcacttgactttaatcaaactagacttagttgtat 
gttggccatgttgtggttatttattgcctgatgtatggctctgaaaagttatatgggttttc 
ttttctcagtttcttgaacatactgattgttctatgttacctgaaacacatgacagtagaga 
aaagcattatattatttgagcaaccctcttacgtctgagaacgg 

SEQIDN0113 

cgtccgaaatggcggaagacaagaaagagtcaacgtcgagttcgccgctccaagaagatccc 
gaagatcccgtcaaatcccctccttcttcccccaattcctccactcgcaaggcttgctatgc 
tgttcttcaaagttgggtgtcaaagaagttcatgactggatgcgtggtcctcttccccgtgg 
ctgttacatttttcatcacttggtggtttattcaatttgttgatggtttcttcagccccata 
tatgaaagacttggtattgacatatttggccttggatttgtgacatcgataaccttcatatt 
ctttgtcggtatttttgcttcatcatggctgggttcaacagttttttggataggggaatggt 
ttataaagagaatgccctttgttaagcatatatactctgcatccaagcaaattagttctgct 
atttcaccagaccagaatactaatgcattcaaggaagttgctataattcgtcatccccgaat 
tggtgaatatgcgattggtttcataacatcttcagttgttctccagagagatgatggggatg 
aagagttgtgcagcatttttgtccctacaaatcatttgtatataggagatgtatttctggtt 
aattcaaatgatatcatcaggccaaatttgtctgtgcgagaaggcatagagatcattgtttc 
tgtgggaatgtcaatgccgcaggtgatttctcctatagaaaggatcacacgacagaccgacc 
ggatccctctaaacagaatgttaaagtaaacagaatcatcatctcatttgcttctggtttgc 
gctaagctaccataatctcatttttttagggaagtcgcatatgtatactgttggtcttctat 
gttcatttgatggttcagcagatctgaactggagcaattagcaacattggtgactttgttgt 
gtgtttattctttaggattagtaggaggagttctgtttgtcggaaacaaataggtagggagg 
cattgtttggctgtagctggtttactctaattaacatttcaccgtggtctgtacagtcttgt 
aacttatgagttcttgtgtttgtattataaagaggctatcagtgttatcgc 
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SEQIDN0114 

ccacgcgtccgcccacgcgtccgcccacgcgtccgcccacgcgtccggcagcctacagtcca 

tattcacgtgctacatcccctgccccaactttggggcatgatggccagctctatggatcaca 

acaataccactatccgtatttccagcccctccctccaaccagtaattcgtacactactccag 

ttgccctgccaaaaggtgagattgccacctctgctgctgctgctgaccatgcatcgttgtct 

gttgattctgctaatggaatttctaatggcattgccaatggtggtgtaaagggaaatgctgg 

gcctacgcttgtgaggcctgcattccagaacccatccgtaaatgctaatggttcttatggac 

ggggtgcgttgcctggaggagctgcttcaggttatcacgaccctagattaggttttgatggt 

gtgcgatctcccattccatggatagatggatcaatgttcactgacgggcaaggtaggctagt 

gtcgagcaattcttttacaccatctttttcaaatggcagtgccgttccatcatcaaaaaatc 

agaatgttcatccgcatctaatgggcttccaccacccaaggccctcttctggcatgaacaca 

acaaatgggtatatgaataggatgtaccccaataaactgtatggggggcggtattgtaacac 

attcggtactggcatgggctttggatccaatggatatgatacccgtaccacaggtcgtgggt 

ggatgacggttgacaacaagttcaaacccaggggtagaggaaatagtttctacggtaatgag 

aacatggatggtttaaatgagctcaacaggggacctagaggtaaaggtttcaagaatcaaaa 

gggttttacaccagtaacgctggcagtcaaggggcagaacgttccgctcaccctaaccaatg 

atgctgagaaagaaaaaccaagcctgattcctgacagagaacaatacaactgtccagatttt 

ccagtgacatatactgatgccaagttttttataatcaagtcttacagtgaggatgatgtgca 

caaaagcatcaaatataatgtttgggctagcacaccaaatggtaacaagaagcttgattctg 

cttaccaggaggctaaacaaaagtctggtggttgccctgtttttcttttcttctcggtgaat 

acaagtggtcagtttgtcggtgttgcagagatggtaggaccagttgatttcaacaagagttt 

ggagtattggcagcaagacaagtggatcggctgctttcctgtaaagtggcacatcgtgaagg 

atgtaccaaacagcttgttgaaaaacatcacgctggagaacaacgaaaacaagcctgttacc 

aacagtagagatactcaggaggtcaaaatagagcagggcctacaggtgattaagatatttaa 

ggatcatattagcaaacagtgcatccttgatgattttgagttctatgaggatcgtcagaaga 

gaattcaggaaaagaaggtcaagcagcagctattccagaagcagtcgcaggtatgggaaggc 

aaagctactgaagagaagaagaaagaaaacacgaaagtggaacctaagtcccagaaaccttc 

agaagttcctgctggtttgaacaaggaaagtttacccgctgctccgactaatggggaggtga 

agcttacagaaaatggatcagttacaaagggagatgatatgaagggtgctaaaccagtcact 

gtagcggaaaagaaacctgtagctatagggatagcaaaaggagttgctaatggatgctagct 

tcacctaatgaagggggaggtctgtggttaaagaagccctaaattggagcttgttgactaca 

tgatatgcacgccagtgcttggttagatctcataaccattggactgccccttttatcctagc . 

tgcatttggagttggttcttgcattaagaaatccccggagataaatcaatagtggcaaggct 

agttcaatctgtttctaagagttcaggaagtatggaagctccattttccctcaggttttagc 

ttctgacaggtttcataccttggtttgggttttaggataatttttttttataattttgtttt 

cgtcatgtggcttattttggtcaattttcccctttttttaaaagttatttgggttttaaagg 

gtggggttcttgttattattagtttggctcccaatcctatcttgtaaatctagatcaatctg 

ttgcggcagttcccaacattgcttttttgtactaatgattgagctagaagctagttttaaat 

gtcaagtctctaccgg 
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SEQIDN0115 ' 

ccacgcgtccgcccacgcgtccggcgggatttgtgagtatttgaatgaggaaaaagggtgtc 

cagtgtgatgagatatgagggttaagattgtaggaaaatgggttgtgtatttgggaaagaga 

tttcatcttctgagacacctaatggggaggttgtagttggaagtaggagagaaaatggggta 

gatagagatttggctgccccatctgggaggagagagaaagttggtactgtaaataaagtgga 

tgccgtaggtggcggcggaggaggtagtgatgttggtgaagttgtgaatggtagggatcaga 

aggatgagaagaagggtgagaatgcaaggcataggggtgagaggagaaggtctaagcctaat 

ccaaggctaagcaacccacctaagaatgtgcacggcgagcaagtggcggctggatggccgtc 

atggctttctgatgtagctggagaggctatcaatggttggattccgcgcagggcggatgcat 

ttgagaagctagctaagattgggcaaggtacttatagcaatgtctataaagctagagataat 

ctaacggggaacatcgttgcactgaagaaggttagatttgataatttggagccagagagtgt 

gagatttatggcaagagagatcttgattttgcgccgcttggatcatccaaatgttattacgt 

tgcaaggattggttacgtcaaggatgtcttgtagtttgtacctcgtgtttgattatatggat 

catgatttagctggacttgctgcaagccctggaatcaagttcacagaggctcaggttaaatg 

ttacatgcatcaactattagcagggctcgaacactgtcataaccgtcttgtgttgcatcgcg 

atataaaaggatcaaatcttcttattgacagtgggggagcactcaagattgctgattttggg 

ttggcttctttctttgatcccaataaaaagcagcccatgactagtcgtgtggttactctatg 

gtacagaccaccagagcttctacttggagccaccgactatggtgttggtgttgatctttgga 

gtgctggttgtattttagctgagctattagctgggaaacccattatgcctggtcgtacagag 

gttgaacagctccacaagatcttcaagctatgcggatctccgtcagaagaatattggataaa 

gtcaaggcttccacatgcaactatattcaagcctcagcaatcatacagaagatgtatagcag 

aaacttttaaagattttccgccttcatcgttgccattgattgagactcttctggccattgat 

cctgctgagcgtcagacagctacaactgcattacagagtgcattctttactaccaaacctta 

cgcctgtgaaccttccagccttcccaaatatccacccagcaaagaaatggatgcaaaacggc 

gagacgaagaatctcgaagacaaagagctactgggaaagctaatgctgatggtgtaaggaga 

aatcgtcaccgtgatcgagcagtgagggcaatccctgctcctgaagccaatgcggagctgca 

agtcaatatcgataggcggcgtctagtaacacaagcaaacgcgaagagcaagagtgaaaagt 

ttcctcccccacaccaggatggaacattaggttataccttgggttcttcacatcacattgat 

ccagcctatgaaccctcagaagttccattcacatccattaatttctcatattcaaaagaacc 

gatccaaacgtggtccggcccattggtggaacctgcaactggtgctccaagaagaaagacaa 

agccatcaaagaaggattctaacaagaaaggaaaagaaagcctgtaaagtctataatgaacg 

acgtgattctacaatggtatacttcaaggagctgcaaaacttacagattatttgtcctatac 

gtaaatcaagaagcttctcaacagcatagagaggtaaacaagcatttttatcgtagtattct 

cctttgtattcttttggataatgagaatcttttcattattgtacatgtaaattttgtttctt 

catattagcaggctctgtttagatcaataaaatcgtaacgctc 



r 



WO 03/085115 



71/140 



PCT/EP03/03703 



SEQIDN0116 

ccacgcgtccgccgttttccaacttccaatgcgcggcaaaccctaatcctcagctttggttt 

ttgcctcagaaaattcatccgtcaatttgacctctattatggggcgcagtgactctagatca 

cctgccaggggtcgtggatctcctcgtaagaggagcccttcacgcagggaaaggtcacctgc 

tcggaaaaagagttcacatgctgcaagttcagctgtagcagagaagccttcaaaccgtaata 

ggtccccgagacgtgcaaggtcaagatctcttgttcctctttcacctgcaacagagaggcca 

tctagtcgcaataggtccccaaagcgcagaaaatcaatctcccctgcatctcactccccagt 

cagagagaaaccctcgagtcgcacgaagtctcccaaacgagctaagtcaaggtctcctgatt 

cgaggttgttacaggtagagaagtcttcaggccgagtcaggtctcctagacgtgccaagttg 

cagtctcctgaatctcgctcaccctcaccacgaacaaaaagactaaggagagcagaacaaga 

gactgaagaaaagacaagggggcgcgagcctgagaaaaaccatgggagagctagtggtaggg 

ctgctctacatagggagaaggattctgatagaacagtgcctgaatcccgttcaccgt caeca 

cgaacaaaaagactaaggagagcagaacgagagactgaagaaaactcgagggagcgagagcc 

tgagaaaaatcatgggagagctagtgatagggctacacatagggaaaaggattatgacagaa 

cggtgcttgagtcccgttcaccgtcaccacgaactaaaagactaaggagagcagaaccagag 

actgaagaaaagttgaagatacgggagcccgagagaaatcatggaagagctagtgatagggc 

tacacataaggaaaaagattctgacagaatggtgcaaaatgaaaggagagagaaaagatcag 

gaaaggatgcactggataatggatcttctaagtcaagaaatggtcgatcagcttcaccttca 

gaacgtcagcataggagtcggcacagatcgagatcacctgcagcagcggacacgagagcacg 

cgatgagatgacaagctcaaggagaggtgaactcaggaatggtgatgatgactccttatcta 

aaatgcaggcggcagaggaggccttgcaagctaaaaataaagacaagccttcgtttgagctc 

tctggaaagcttgcagcagaaactaatcgagtaagaggtataacacttctctttaatgagcc 

accagatgctagaaaacccgacgtacgatggcgcttgtatgtttttaagggtggtgaagtcc 

ttaatgagcctctatatgttcatcgccaaagttgttatctttttgggagagaaaggagggtt 

gcagacattcctacggatcacccatcttgcagcaagcaacatgctgtcctccagtacaggca 

agttgagaaagacaatcccgatggtacttcatcgaagcaagtaaggccgtacgtaatggatc 

ttggaagcactaatggtactttcattaatgaaaatcggattgagccccagagatactatgag 

ctattagaaaaggatacacttaagtttggtaatagtagccgagagtatgtgctgcttcacga 

gaattcagcatgatgagtctctaaaatggttgacggaggtgtcatttgcattgattggcttt 

gacgtcagaagctttatcagatcaaatatttgctgtgccatgttactagcaggatagccgtt 

gtaagtgcttagccgaaatcgtgtaatgtggtagagatttgggcattgcttgcaaagttttt 

cactgctaatgaaaattttggtttatgcatcagtgatttatcctccagtttgtttataagct 

ctttgtcccctatatatgggatatgttattgttgattaggtcttaacttgtgaatgtgcgct 
cttttcttctaattattgaagatgctggagtgccccc 
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SEQIDN0117 

attagatgcgtttggctttgactcaagcccttgaatcaaaactgcaatcaccgttgtggcag 

gttcgtgtgaaagcaatctgtgtcctcgaggctatcttgaggaaaaaagatgacgagcactt 

tggtattatggcatcttatttcaatgaaaataaagatgttgtgg'tgaaatgctttgaatctc 

cccaagcgtcattaagagaaaaggcaaacaaagtcttaagtcttctgaatgatggacaaaca 

gctgattctgtgcctcatgtagataggtcagcaatggctggtgcccctgttgttcagatgcc 

cgacttgatagacacaggtaattccgatgatctgtttggagcagatgatttagcaaatatgc 

agagtggtgaagggataaaaattgcatccacctctggcgcccctctggttgatgatctattt 

ggagacaatttgggtggcggcgtggcttccggccagcagaaaaatggtgatgac.ccctttgc 

tgatgtctcatttcacaccagtaatgagaaggcgcttgaagctgatcacttttctggaatga 

catttgataaaacagatgctactgaagtccatttggctgtcgatagaactggacctgaactg 

tttgacatgtttggtcccagtgttgaagttccccaggatcccaataatcctagaaaggagat 

tcacgatttaatgaatagtctctctttgaatgggaatgactcatctaagaagcagaatggca 

gctcaaggggaacctatccggatatgtttcaagagtccactattgatcctcatcaggcttcg 

aatgatgccttgaacagcatattttcctcccaggctggtggagcaaattcaaatcccatgtt 

tcctttgggtgctatgcagtataacttgcctcctggcttcgtattgaatccatcatttgctc 

ctcaggctctaaactataatgccatgggtaacatgtttgctcaacagcagttctttgcaaca 

ctttccagttaccagcaattagctaccatgcacccatccactagtgctagtcatgccgctga 

ttctgctggaggttatggttcagctcttcccgatatcttcaatcctagcatttctaatcata 

gtcctacttccttgatgaatacttcaaagaaagaagatacaaaagcatttgattttatctcg 

gtaagtttggtccttgtgtacttcaatttttctattattactttgaagatgcattgaattgt 

gacggccctagtgtgtccctgattttgaaggtatgccattaacaattatgccttttgtttat 

ctattttatttggcctaaatccctccttcctccactccaaaaagatgaggtcctccgccttt 

attcttgtggtgataaatgaggtccattgcattgtccttttccaggcccccagtcatgtaaa 

gaataagtggacttggaaatactctggcatcataatcagctattccttttctgttaatgtac 

ttagatatctcattgtagggcttctcagtgcttcatctttttttgtcaatgttgtgagcaat 

aaagtttctcagttctgattgtgtgcaatatcatcttttccaaactgagaagactagaaaac 

ttcatttaggactgtatgactctaattttgttgccatggtggattccctgtgttttttgcag 

gatcatttggctgcagctcgtgatccaaagagggtgatttgagtggttatagctcaaagcaa 

cccagagtatgctgcttataagatttagctatgcacaatttgaagcaggagtaatctgtaaa 

ggttcttttgaagcagtgatatcaatgtgaaatacagtattatttttttttt 
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SEQIDN0118 

ccacgcgtccgaacgattctccctcgtaacttcattttcagtcatggcttctgctactaaga 

aggtattggttccgattgcgaacggaaccgagccgattgaggctatagtgcccatagatatt 

ttgcggagagctggtgcggagg'ttactgttgcctcagttgagaagcagcttcagattgaagg 

agtgcacggaattaagatcgttgctgatgctctaatttctgattgtgcggatactgaattcg 

accttatctcacttccgggagggattcctggtgcaaccaacctaaggaattgcaagactttg 

gaaagcatagtaaaaaagcaagctgaaaatggacggttttatgctgcaatatgcgctgctcc 

tgctgtagcacttggatcatgggggcttctgaaggggctgaaagcaacatgttatccgtcgt 

atatggaggaactatcatcttataccattgctgttgagtcaagagtccaaaaggatggaaca 

gttgtgacaagtcgaggaccaggaactgcgatcgagtatgctgttgcattggttgaggagtt 

gtatggaaaagagaaggctgatgaagtttctggcccactcgtgatgcgcccaaatcacatcg 

aagaatttgcatttgctgagctcaattcagtaaattggacatttacgagtaagccacagatt 

cttgtacctattgcgaatggttctgaggaaattgaagcagctactattatcgatgtacttcg 

acgagcaaatgctcaagtagtagtggcatctttggaagatacattggagattgtcgcttcca 

gaaaagttaagctagtagcagatgtgctccttgatgaagctgctaagcagtcatatgatctt 

atcgtcctgccgggtggtcttggcggtgcccaagcatttgccaactcagaaaagttggttga 

catgctgaagaagcagagagaatcaagcaaaccgtatggagcaatgtgtgcatctccggctc 

tagtcctagagcaccatgggttgctcaagggtaaaaaggctactgcctttccagctatgtgc 

aataagctctcagatccaagcgaagcagaaaatagggtgttggttgatggcaatcttgttac 

tagcagaggaccaggaactaccatggagtttgcactggccattgcagataagtttattggcc 

gcaaggaagtactagtgctagcaaagaagatggttttctaagtagaatcattttgcctgtct 

tccgtctttaggattatataggcacccatagttacccatagttgtaaacttgtaataaactt 

tggccatcagtgtgcacttaaaataaaagaaatgctgtatacgagttacactcagtcgcatt 

tgctaatttcctattcaatgccatcgctttttaaaaaaaaaaaaaaa 
Group 3 

SEQIDN0119 

GNATGNCTGACCNAGNTGNGTGCTTAAGTNTCGCANGCNNCTGTAGTGNAGGGGACCNNNCA 
NTNTCTNCTNGACGNCCGCAGTAACCAGNNCTCTN7^ACCNATGCATNATGCAGATNCAGGCT 
TTNCAGTCTNTTANGGCTCAATTGGTGTATGCAAGNTCCAGGACATGGTGTACGATCAGATN 
ATGATCTGCAAGCGAAGAATTNTGNTTTNTCAAATGCTCTTTCAAGCCCTGTTCGACGAAGC 

CTGCAGAACTATCAAATTGCTCANGGAGGTTTCCTT 
SEQIDNO120 

ACTCGCGGACGGAGAACAACCGAGAGAANGGGAGACATNGNTCCAAANTCGTGGACTNCCAA 
TGTACTGNTGAGCACTNNGTAACTNATNATNTGGNTNATGAGGGCNNGGCANGAATAGGCAG 
NACGCGGGNGCAAACCCTGCGAATTGATACGATCAGATAAAAGATCATCANNATGGANAGGN 

GCNGNTGTTCTGGGGT 
SEQIDN0121 

TNTGGCTAAGAACTGNNCTATANCTAGNGACANTGTGCTATTCGACTCACAGGAAAAACTAC 
TAAAGGATTCTCTAGATTNGCATAATTTCCAAGAGATAAAAAAGTTTTTCCATGCTCACTTC 
AAGGTGGATCGAGAACTTCAGGCATCTGTTGCTGTGTATGCATTGAAAGGCCAGNGATTCTG 

TTCTTGATGAGCAAAATAGAATCTACAAGTTGGTGT 
SEQIDN0122 
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GGNGAGGNGACGGNGANTGGAGCNGTAGTGTCGCGGGAGGAGGGACAAAAGCTGNANGNAAG 
AACAGCGNNACAAGANTNACACCTNCTGNAATATANT 

SEQIDN0123 

AATAGCAGCGGCAGCATACGACTACGAGAACGATCCGAGATGGGCAGATTACTGGTCCAACA 
TNCTNATTCCTCCTCACATGGCTTGCCGTTCCGACGTNGNNNACCACT 

SEQIDN0124 

TTNAGNATCCGNAAGTTGAGCAACAACTTCAGCACTCGNGCAGAGAATGGAAANTCGATGAC 
ATTGTGNACGCTNGNGGCAGTGGGGGTACGGACGCTGGATTGTCCATTGCATCCAGGCTCAG 
NGGC 

SEQIDN0125 

ACTCCTTACAACCAGCAGATTGCAAATTGCTGCAAGGGAGGAGTGATCAACTCATGGGGNCA 
AGNATACTGCAACATGCTGTTAGCTCATTCCAAGTCAGTGTNGGTGCTGCCGGAACAACCAA 
TAAAACAGTTAGAGTTCCTAAGAACTTCACC 

SEQIDN012 6 

GCCAGTCATTCGTTGTCCCATAAGCCCAACAGCCACCCAAAACCATCCAAATCTGAGCTGTT 
NTCCAGCGCTAAGCTAGTGGCCGACGCGGCCAAAGCAAAACTCCATCACGAGCCAAACAGTA 
AGGTCGACAAGTCTGAGCTCGCCGGAGCCGCCGCTGACCTCC 

SEQIDN0127 

AGAGNCAAATCCCACATGGGGCATCTGGCTTGGGCGGATGCTTTTGTCATCACAGCAGATTC 
ATGTNTAGACATGTTGAGTGAGGCTTGCAGTACTGGGAAGCCTGTGTATGTAGNTGGAGCTG 
AACGCTGTACGTGGAAGCTCACAGATTTCCACAAGACACTCAGAGAGAGGGGACTGACTAGG 
CCATTCACAGGACTTGAGGATATGTCAGAAAGTTGGAGTTACCCTCCGC 

SEQIDN0128 

CAACAAGAGGAAANTGGAGTTCAATTGACTTGGGAACTGAAATTTATAGAGACGCTAATCAA 
GTGGCTGAGTGGACTGTCTCTGATTTTGACATTCTTGTACCCAATAACAATTAGAAGTA 

SEQIDN0129 

CTTTTGCTAGAATCTTGCAAGCTGGACAAGGTGAAATGGTCGGGAGTGAAAACACAGCATTC 
AACAACCCCGTTCGTTGATGAAATGTACGAGCGCCTGAAAGAAACTCTAACTGACTATGAGG 
TCATCATCTGCCGTTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCTGATATTGAG 
AAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDNO130 

CAGAAAAGGAGGAAAAAATGAGAAAATATCTTCTGCTTAGAGTGTTGTCAAAGCTTTTGCCC 
TCACTGCCTTCCTTCTCATCATTTTTGTCCTCTTCTCTTGGTCTCTCTCTGTATAATTATGT 
AGTAGATAAAACTTCAAGTATTCATTTGAGGTTTTTGTTTCCTAA 

SEQIDN0131 

CTCTNCCTCACACGAANNAGTACTTGACAAGGGAGTTAGTACTTTATANNGACGACANTTTA 
GNCN 

SEQIDN0132 

NTNCCATGTTNANAAAATNCAAGCTCTGAATGGAAACGGCTTGGGTGCTGATACTTCTTCCT 
TTGGTTTCTTGGGACAGATTCCTCGAAACTTCAGNTTGTCGGAC 
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SEQIDN0133 

GTAGNATTCTCCATTTGAAAATACATAGTGTCAGGCCATCTGGGACATCAAACGGAGGTGGA 

GAGACTAGTCTAAGGGCAAGGAGACCACCAAGCCAAGATCAGGA'TGCTGCATTAGCATTGCG 

ATTGCAGTATAGGAATTCTCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAG 
CCA 

SEQIDN0134 

NNNNGNATNTNTCTGNTATAATCTTGCAAGNTGNACAAGGTGAAATGGTCGGGAGTGAAAAC 
ACAGCATTCANCAACCCCGTTCGTTGATGAAATGNACGAGCGCCTGAAAGAAACTCTAACTG 
ACTATGAGGTCATCATCTGCCGNTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCT 
GATATTGAGAAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDN0135 

TCTGGGTATCATTTGGGCTGTTCGGTACAAGACGGATACGGAGAGTCACTTANGAGAAACTG 

TTNGGAAGAGAGAGAAAGAGGCGGGAAACTATTGGCNAGGTGTGTGGAGGAATTGAAGAGAA 

AAGGGGTGGAGTTTGATTTGTTGAAAGAGGTTGNCGCTCTTAGGAGGGCTANNAGTTNGAGG 
GTTGAAACTAAGGTTG 

SEQIDN0136 

GCACGTTTGGCTCGTCTGCTCTCCCGCAAGAAAGTGCGAGTGATATGGATATCAGTAGTTCA 

GTACTTTTGTTAGGGTCAGCTTGTTGGGGAATCCGGTTCTTTTGTTATTAGGTGGTAASAGA 
AACTTTTATGTCGCTG 

SEQIDN0137 

TTANGNGCCAATGTTTCAATACACATTGCCCCCGCCATGAATATCGGAACAATGACACAATT 

ATTTGATGTAGCACAAGAAGAGTGTTCAGNCCTTTTCTTGTGGACTTATTTGGTTGCAGCAT 

TTGCACTTACTATTTGGTCCACTGTATTCATGTGGCTCTTGTCCTGATTGTCACAAGAAGAC 
ACAATATGAAGATATTATATAATGGTGTGGTGTGCT 

SEQIDN0138 

TGAACTGAATNTGGTATCTGTATTACTCCTGTTGTAATGGCATTGGACTTATACGGCCTTGG 
SEQIDN0139 

TGTTTGTCTTCCGGCTTTGAGGGCTTTCCAGATGGTACATATGAAGGAGTTGAAACTGGATA 
TGACGAATTATTTAGGACGCCATCATCCCTCACGGAGAGCCCAGACATTCTCTGG 

SEQIDNO140 

TTGAANNCCCNTTNNGANGCACCACAAGNTNNAATCCTTNCTGTAAATGGTAGCAAAATCCT 
ACCCGATTGGGGATACGGNAGAGTTTATACTGATTTAGTTATC7\ATTGCACTTTCCCTATTC 
CAGTTGGNACTGAAAATGGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNGGCGGNGAC 
ACTAAATTNAACACCGCCGACACTTTTNTAGGG 

SEQIDN0141 

CANAATACCCCTTNAAACGACCCGAGTCTCAAAATCGANGAGATTGGAACACTTTCGGTCAG 
TACTTGAGGAATCAAAGGCCACCAGTTTCCATATCCCAATGTTATAGCAACCATGTCCTAGA 
GTTCCTCCGATACCTGGACCAGTTCGGGAAAACTAAGGTTCACTTACAAGGTTGTATCTTTT 
ATGGACAGCCCGAGCCGCCAGCTCCCTGTACTTGTCCG 



I in 
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SEQIDN0142 

CTNGTGGNANTAAATAACATTCTCATTTCTTTCCNNTNTTNTTCAGGTCCAGTACCACCAAT 
GGAGNCTTTCCCTATNAN 

TAATTCATATAGTCG 
SEQIDN0143 

GNGATGTCTNCNTATTGCACATCTGNTGTTGATTCACTTTATGGGAATGCTCAGAAATTTCA 
AGCAATCGAGACAGATAATCGCAGGCAACGAGCAGCTCTAGTGACCTTACAGGAAAAGGTAG 
ATGCTGTTGCTTACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCG 

SEQIDN014 4 

GAATGGAAAAAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGGCTACTT 
GTAACCACCCTGCTAGTATTTGATGAACTGCTGAAGTCTATAAGGCCGTATGGCACTCTTAG 
TTTGTTGTCCGCGCTGATGTGTGAAGTCA 

SEQIDN0145 

GTTCCTTTGCCATTTCAGCTGCCGGCGGCAGTATCTTCACCTGGGACGACGTCTTTCAACTC 

CCCGAATCTCCTCAAAATGACTCTTCTACCCTCTCAGCTTTCTTCGATAAAATCAAGCTCTG 

TAATCGCAATTCGGAGAAGCAATCCGAGTTCATGCCTTTCGTGATTGAAGACCAAATAATNG 
GATACGTACACCACGTGGTTGCTGA 

SEQIDN014 6 

AGANTTTAACTCNATCCATTACTGTANATGGNATGCAAAATCCTACCCGATTGGGGATACGG 
NAGAGTTNATACTGNTTTATGCTATCAATTGCACTTTNCCTATTCCAGNNGGNACTGANAAT 
GGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNNGCGGAGACACTAAATTCAACACCGC 
CGACACTTTCGTAGGGTTACTCAGGACTCATNAAGAAGG 

SEQIDN0147 

TTTNCACACTAATTCCCCTNTATCTACNAATGAAGTGCGTGGGCTTGCAGTACCTGGAGGCC 
ATTCGTAAGCTCAAGGCTTCTGGCTTCCAACCAACGCGCACTGTCTACCTTTCCTTCGTCCC 
CGACGAGGAAATCGGCGGNGNCGATGGAGCCGGAAAGTTTGTCGATTCCGATGTCTTCGTGA 
AGATGAATGTTGGGATTGTACTTGACGAGGGCTTGCCTTCTCCCACCGAAAACTATCGTGCA 
TTCTATGGGGAGAGGTCCCCCTGGTGGCTGGTCG 

SEQIDN0148 

GNTCCGTAAAGTCCCCAGNNTNCNCGACCCGTNACTCNGGAGTTACAGCGANACANGTGGCT 
GNATNATNNGACATACTCAGACCTANTTAGCTTTGATATAATCCGTGAGGGTAANTTCGTTC 
TTTGCAANCAAATGGACGAACCTGGTATGTTTAGCCTAATTGCAAACAGGTTTGCTGATGCT 
TTTATTTCATGGGTTTCAATTTGTACTAAAGCTCACTTGCCGTTCTTCATGTACTAGAAAAC 
TACATATGTCTATGACCCTTTACCTAGTCTGGTAATTTCAAGGCATGAGATTGNGATTGATC 
AAAACAAGTTGGAG 

SEQIDN0149 

GGTAACTCCGATATCATCGAGAGCCGATACGCACTCACAAAGCGGCAAGGTGCTCGCTATGT 
GCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTAAAGTCCAGGGANGAATTTGTCCAAC 
TATTTGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCCGAAG 
AGGTCAAAAGCAGAGATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCGAAGT 
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GCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTACAGGN 
TTCTGCAAGAGAAGTTTGAGCTTNAGGC 

SEQIDNO150 

TTTTNNCACTTCTAAAACCCTCGTNTGANNCTGCNAGGCATGTGAAGNTGTCAAACTCAAAC 
CTTATGCCANAAAGTGCAAGAAACTGATCTTTGAATATGCGCCAGTGATTCTCGTAAATGCT 
GAACAGTTTCTGGAAAAAAATGACGTATGTGCTATTCTTCATGATTGAGAGCCTGCAGCAGA 
TAAAGAGCTACAAGCATCACCAAAGATGCAAGCTTCATTGCATTCGGCCTC 

SEQIDN0151 

CTGNCCCTATCCGATCCAATAGTTGACTCAAAGGTGTTGCCTATTCCAGCCGGAGATTTGAG 
TTTTGGTTCGGGTGCACAACTGAAAAANTCAGTTGGNAATTGGTCAAGATGTCTTACTGATT 
TGTTTGGCATAGATGCTGAAGATTCCGGNCAAAATGATGAAGGCAGCTTCGGAGATGATCAN 

AGGAAAGGTGGAAATCAACCAGAGCATTTCCATCTTCTCAATGCC 
SEQIDN0152 

AATCCTGTTCAATACCAAAATCAAGCAACAAACGGTTGGCAAGCTTCTTGGAAAAAGTTTTT 
G AT AG AT ATGAT G T AC 

SEQIDN0153 

TCAACTGAGAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGCAGAAAGCCTTA 
TGGGAGTAGCTCCGAATGGTTTATGGATCAAGACACGTCTCGCTGGTATGTCTGGGATGACA 
TGGCACAGGCCTTTGTCAAACGGTTCCAATACAACATCGACATTGCCCAGACCACATTTCCC 

TTTCAAACCTGAAGAAGAAACCAAGTGAAAGTTTCAGGGAATATGCCA 
SEQIDN0154 

CTGGGTGAAAAAGCTCTCCTTNTGCCTTNCCAGAAGCACCTAGCGCAACATGTAATGGATAG 
ATGTGCTCGGNCCATGGATGTGCAACTTTTGCATGCGGAGCCTTCATGTCATAGTTA 

SEQIDN0155 

CGCTGCTGTGNGTTGAATTTTCTCCCATTTTTTGGAGAGGTGTGTATCTGGA 
SEQIDN0156 

AATGGAGAATGGAAAAAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGG 
CTACTGTAACCACCCTGC 

TAGTATTTGATGAACTGCTGAAGTCTATANGGCCGTATGGCACTCTTAGTTTGTTGTCCGNG 
CTGATGTGTGAAGTNA 

SEQIDN0157 

GNTATGTTGCTGATCAATCTNGTTATGGCATGGTTGATCCTTCTCAGCATTATTATCCGGAG 
CAACCATCCAAGCCGCAGCCAAGCATTTCGAACAGTCCTTATGCTGAGAATTATCAACAGCC 

AT TTGGTTCTTCATACAGTAGCGGCT 
SEQIDN0158 

TACCCAAAAATAAAAGTACCATCCTGATGCATCCTAATGTGCTACATATTGCAATCTTCATG 
GGTAAAAGAGGTCATTTGGCGGACCAATGAGGT 

SEQIDN0159 

ACGGGGCCTCNAGGCTAATAAACAAACAGAAAATGAAAATTCTTTTGAGAAAGAGTTGCTAA 
AAATGCAAGAAAAACTTCAAAAGATGACACTTGAGAAGGAGCAGACTGAGGAAATGTTGAAA 
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GCTAGAGAGGATATGTTGAAGCAGAAGGAGGAAGAGCTCGAAGCTCGGGATAAAGAGCANGA 
AAAGCTTCAAATTGAACTCAAAAAGTNGCAGAANATGAAAGAGT 

SEQIDNO160 

TTTTNNGANGTACTAANNCNCATTNTAGCCGACCGCACTCACAAAGCGGCAAGGCGCTCGCT 
ATGNGCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTA^GTCCAGGGAAGAATTTGTC 
CAACTATTNGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCC 
GAAGAGGTCAAAAGCAGANATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCG 
AAGTGCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTAC 
AGGTTTCTGCAAGAGAAGTTNGAGCTTNAGGC 

SEQIDN0161 

TAATCNCACNAATNGAGGCCCTATGCAAATCTCNTTCAAGTGGAGTTTGCTTCATACTTGCA 
TTATTGTCACTNT 

SEQIDN0162 

TTATTGGGTAATTCCCATCTACTGGGTCTTTCTCAAGATTTNTCAACTGCATCTGTAGCCAG 
TGATAACAAGCAGGATG 

SEQIDN0163 

AAGTAAACTCNCGGTAGGGAAACTACNNCGATGAAGGTCTTCAGTCAGCTGAACTACTTGGA 
TGTGCTCATATCCGTTNGAATGAGCTTGAGCCTGGTAAAGTAAAGGNTATTTGG 

SEQIDN0164 

GNCGCGANTTCCTTCGTCC7VAGACTGANGCTTTNTANTTCAAACAGGTAGTTCAAATGCTTA 
CTGGGTCCTCTG71AACCGCCAAGGTTGCAGCTACTCCGGGTCGGGCTGAGCCNGTTAGACAT 
CNTATCCCGCCCA 

SEQIDN0165 

TGCTGTGANTCTTTTGCTACATATGCCTACGATTTACGAATNTGCAAATAATGTTGGATTTT 

CCAGGCATCTTCAGAATTGCCTATGGTGCAAATTGCTTTTCAGTATACTGTTGTTGNCCCAC 

CAGATGAACTTGCAAATGCAGGATCAAGTTCTACAACAAGAACAAAGCATTCCCTCAAAAGG 
AGA 

SEQIDN0166 

GCACNTGTCGAAAATCAGGATTGATGTCAATGCTGATCAGCACCCCTTTCAGTACAAAANCT 
AAATCAACCACAGAAGCCAGCTAAGGTGGACNTGAANTCCGCAGTTTATCCTGGCGGTCCAC 
CTTCACCGGCAAGGGCGCCAAAGATGTCGCACTTTGTCGATACAACAGAAATGGTAAGAGGA 
CCTGAGGAGTCACCTGGCTACTGGGTGGTAACTGGTGCAAAGCTATGTGTAGAANATNGTAG 
GATAAGAATGAAAGTGAAGTACTCGCTC 

SEQIDN0167 

CCTTTTNAGGCCACGNNTNGGAGCAGCAAACACAGCAAATNTAGCGATGAGANAGCCAGTAT 
TCATAAAAGTGGATCAGTTGA71ACCGGGAACAAGTGGTCACAATCTGACGG 

SEQIDN0168 

TTTTNCTTTGGATACATGGCTTGCATCTGCTATGGTTTCTTCCTCATGCTAGGGACGTGTTG 
GTTTCCGCTCATCCATGCTCTTTGTTCGTCACATTTATCGATCTATCAAGTGCGAGTAAACA 
TTTTGTGTAGTATTTGTTTCCTCCACTTTAGCCTCTCTACTTCTTCGGGGTAGATGAAAAGT 
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CTGCGTACACACTACCTCTCCAGACCCCATTAGTGGGATTTTACTGGATTGTTGTTGTTGTT 
TACACCACTTTGGTTATACC 



SEQIDN0169 

CTCCTACTCCTCAGTGTTTCTCAGCCAGCCGTGGAACTACAAAGGCCACTCCATCTAAGGCA 
AAGTATAGACCTCTGGAGACAAGGGGTATCCTTCAAGAACTGGAACAGAGCAGCAATGAAGA 
GAAGAGAAAGGAAGATCAAGGGAAGATGATGAGTAATAATCAACAAGGACAGAGAGGTGGTG 
CTATTGTTGCTGAAAAAGAAGCTGCTGCTAGAGCTTTGGATGTCTTCTGGTTCTTGAAACCT 
TGCACTCTTTCCAGCTGAAATGGTCAAAGCCCACTGCTGCAGAACATTTCATGAAGTGATTC 

TTTCATAC 
SEQIDNO170 

GTTCCCTCCTACCAAACTTGAGGAAATCAAGTCTATGCACAGCCCACAGTTAGCACAAAGGG 
CTTACAGCCAAGAGTCAATGTACTAAGAGGAGAAAGGAGCCCCAGATTGACGGGCAAGGGAC 
TTGAAATAAAGCAAACTCCTAGCCCCCAGCCATCTAATCTGGGTCAAAATGGTCGTGGTCCG 

TCTTCTACCTAGTA 
SEQIDN0171 

NTCNAGANTGATCGANCAGANGGTGCNGATGATACTTTGGNAAGGCCTAGTGAAGAGGNCAA 
CTCCAGATAATGAATCANAGTTTCAGGTGGAACAAGAGAGAGAGCANNTAGCNGCGGACGAA 
AGGGAAGAGGGAGAGCTAATTGCTGATCCTGAAGATGTTGGAAATNTCGAGGGAGTNAGCAA 

TTTA 

SEQIDN0172 

TAAGGNCANNCNAAGNCCANCAGTGCCATNACGCNNATTGCCTGACTGTTCANTGCCTACAN 
TNTGCNGTANTTCTAATGGCGANCTGAAAATGGCCAAGGNCCCCNAAACCTAGAGCTNTGTC 
AGTAGANTNGGGTNTATATTTGAATTNGATNCTGTTGAGTGATAANGATGGTGGACNCNTTG 
TACCTNTACCTGANTGCAAATAANGTNTTGTCATCAACAGANNTTATGCTA 

SEQIDN0173 

GTTAAGNGNGGCNANGAGGAGGCTGTTTCNATGCAGNNNGTCTGGNCTATCNNGTNNTNTNT 
AGGNNNANATCCTANGNCTCACCTGGNTCTCTTTAACCCTGAGNATCATTNCACCACTTTNA 
CTCAATNTNCTCAGNCCCANCTNTTCCTNTCAAAATTCGAAATTATTGTNCCCATAGTATAT 
ACTCTGTTTCTGGTCCTCCTTTTCCTCTGTGCTGTAGCCACAATTACATACAGCACGCTTCA 

TGTATCCTATGGTAGACCTATCAACCTCGTTTCCTCTA 
SEQIDN017 4 

GTCGNGCAAAAGAAGTTGTGGCTCACAAGTGGAAGCATCAGAGATACAGAATAGACAGTGNA 
GTTTGAACACTTNTTCCTGATTTATTTTCTCTCTGCCTTTAGGGA 

SEQIDN0175 

ATGAGAAGCCAAGGAGTCCCTAGTCTTNCGNGCTTGTTACCTGGGGCCCAAAGGAGCAACTA 
GTGACTCGNA 

SEQIDN017 6 

NNTCATCACCTATGCCACTACTCTTCTTCTGGAGAGGCGTGGGAAAGAGATTGTCTTGAAAG 
CANTGGGCACAGGCAATTAGCAAAACAGTTGCTATAGCAGANATCA 

SEQIDN0177 
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TTCGGCCAAGCTGACGCTTCCTCTTTCAAACAGGTAGATCAAATGCTTACTGGGNCCTCTGA 
AACCGCCAAGGTTGCAGNTANTCCGGGTCGNGCTGAGCCNGTTAGACATCCTATCCCGCCCA 



SEQIDN0178 

CGTNGNCCAACTCANGGCTGAACAAGTGATGAAGNGCCCTTCAC'GGGTTCCTCTGNAAGAAC 
CAGNGGCAGTTGGTGGTAAACATATGTCAAAGTCTNCAAGTATGAANGGAATCATCACCCCT 
GCGCCAAGGTTGAGNTTCTCCCCTTCCTTACCTATCACCCGAGCATCGGNTTCTNCCTCAAA 

GNCTTCTACGCAGCCCTCGTCTCGTCCTTCA 
SEQIDN0179 

CGNGTGGAAATCGCCCGTAACTGACCTGACATTTCCGGAGTTTACTGAGGAAGAGTCAACGT 
GGGACATGTGTTCGGATAA 

SEQIDNO180 

NTACTGGTACTNGAGCGGGGGAATTTTTCGATTATCTGCTGGGCTGATTCCATTCAGTAATG 
CTTTCCGGGAATGTATAATAAAATGCCGAGTTGGTGACTGAGAAGAAACCTTGTAAATAAAA 
TCACATAGTTCGTTGNANGAGTCGTGGTCATCAACTCCAATTCTGCATTTCACNCTCACTGG 
AACATTTGTATTTGCAGCAATTACNCACATGGCCTCAGCAACAAACTTTGGATCAAGCATGA 
GTCGCACACCAAAACACCCATGTCCAGCTACTTTAGGGCTAGGACATCCGCAA 

SEQIDN0181 

AGTGCCCATTTTNTCAGGTTGNANAATGAGCACCTTGNAATAACGTCGGGAAATATCGTGCC 
TTCAAGTCCGTACCATTGCCTT 

SEQIDN0182 

ATTCCCGGTTTAACCTCCAGTATCCTGTTTTNCTGATGAAGACATGCTAGAGGTCCCAACAT 
ATGCTTTAGAAGGCTAGAAACTTGTGAAGATAGCAGATGGTCATAATTGTAAAACTTGGGTG 
TCATGAAAATATACGTATCACGACAATGACTGGTGGCTGCAAAGTTGAATGTGTTGCTGATG 
GTGACTGTATAGTTAGTTGAATGTGTTGCTGATGGTGGCAAGAGAGGGTTTATACTTTTGGT 

TGTGTT 
SEQIDN0183 

CCNNNGNAAATCCCATACANATNTTGCNCTAAACTTNCTCACCGAAACAAACCTGGATGTTC 
TCTGNGAACTGGNTGATNTNC 

SEQIDN0184 

CNTTTGGAACCNCCCTCAAGCTGAAAAAGAGGANGGGATTATACCTCTTACCGACACATTGC 
CAATTATGCCACAGGNTTTATGTCACCATGCCC 

SEQIDN0185 

NAGCGCTTGGTCTTNCCTCACTGCTTNTGGTGNTNAATCTTGGNNTNTGTNCCNAGNCGTCN 
TCNNTAACATGAAGCNTANGGTNANAAAGGNAGAAAAGTNTNCTGCTNGAAGGGCTATCTTG 

CCCNTGCTTCANGCTGAAGAGGATGAAAGATTCGT 
SEQIDN018 6 

GCTTAGANNTCTTGAAGGTGACCTGATAATGGAATCTGGCCAATTGNCTACAACACCTAGGT 
ATGATGTGGGAAGCCAAAGTGGAAGGATTTTGTCTGATCACTACATNCAGCNTCATAGGTAC 

AGNGNCTCNATACTAAANGATGGGTTGGAGGGA 
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SEQIDN0187 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
T C AC AAAAT GAAC AN CAG C 

SEQIDN0188 

GANAGCCCATGCTGNTNTANNANAAGCTGCNGAGGCTGACTTGGNGGNNCTGTACNNAGGAT 
ACCAANNTGTTNTTGCTNATNCACGCTANAAGGGNGACTATTANGCCTAAGGATATTCAGNT 

GGCNAGGCGTATTACGGGAGAAAGGGC 
SEQIDN0189 

CGTTAGTACGCNCATCTGATAATGACNTTGAGAGTGCAAAATGCATGAGCTTGTTGTCAAAT 
AATCATCATCATTTCAGTACAAGGCCAAAGGCTGATTTTACAAATCAGGTCATCCGCAAAAT 

GAGGGACTTACCAGCTGCAAAGTTATCTCAGTTGC 
SEQIDNO190 

ACCTTGGAATTNCTTACTGAATGACTAAATGACTTGCTCGAAGGACGAGGTGGTGGCATTAG 
AGCACGGGCAGATATTTTGCCCCTTCTTGATTTTTCATCGCTGGACATGCTAGAAGCTGCTT 
TCACTGATGTACTAGAAGCTTGTTCTCTTGCACTTTTATCAGTCAACTCGTCATCAGCATCA 
TCACTTGAGCTTCCACTTTC7\AAG7\AGTCATCGCTGC 

SEQIDN0191 

ACACCNATTTCAACTACCNNGAGCTGACAGCNTACNCAATACACCGCAGANGCTTTTTGCTT 
ACTCCAAAGTTGCTCCCTAATCTGGAATACAGCGAAGCATGCAGCATTTTGACTGTAATGAA 
TGGTCCTTGGATTGAGCAGCCATCCAAAGCTTGGAGTGGTGGAGAATGCTGGAGGTCTGTGA 
TGGNACTAATGGNGGAACGTTGTCGCTGACTAGAAGCAGAAATACTTTAGGTTGATTCGAAC 
AGGATTTGTCCATAAGAAAAATTTGCTTCCTTGTTGATCTGCCATTGCGCTAGTTACAAGCT 
GAATCATGGTCGCTCANCTATGTTTTTTGAAAAATCTGTTATTACTGGCCCTTGTTCTAAAA 

TAACATAATTCTTTTGTACGC 
SEQIDN0192 

CGTTTGNNTCACGTTCTAATGTACGNTNACTTCATTGGAAACANTCCTACACATTCAGAGAC 
GAAACTGGCANACTCTTACCTCTATGACAAAGCTACATGNATTCTTGCTNGGAAACTCTTCC 
TCCCGGAAACAGATTTCAATCTGGACCATCTAGCTGCAAATCCTCTTGTACCAGAAAAAGAT 

ANACTCTTGGAC 
SEQIDN0193 

CCTTGG71ATTGCCTCCATTGNTGCGGGCCAGGCGTTCCTGATGTATGGAACGAGTTGCTATT 
TCAATTTTTGTGANAAGAACAGGTGCACTTTGTAAGTAATCTTTNCACTATCATTGGAGAAA 
AGAAAAGTTCACACCTTGAACGTAATGTACATCTCGAATGAGCACGCCCATGTTTCTACTGT 

TAGTA 
SEQIDN0194 

CCTTTCTAGTTTGCATCACNTGCATTTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTC 
TAGGACANGTGTACCCAAA 

SEQIDN0195 

CCTTNNACATTTTCTGGTTAGCCTCTGGTTTGTTTTTGATGTTTTTAGCACCGGTGTGCATA 
AT CCAGTGTGC 
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SEQIDN0196 

GTNNGNGAGCNTGGCNNAGGATGCAGACTACCAAAGCTCNNAAN'NAAGCTTCTNTGAACACT 
CTCNTAATAGGTNAGATGTAATGGTCTTTCAGAATGGCCAACACAGCAATANTGCCTTGAGT 
GTCCCAAATCACTGATGCCATGATGTGGGCTCCTAGACTGNCCTGACTNTNGCTTTGACTGT 
GCCANTGCCACCTCNTGGTAGCCNTNAGTTTTCATGATGCTTTGCCTTGGAGATCATATTGN 
CGAAGCCATGTTCCAACTGCCGCTTACAATTTGTCAGAGGGATNCGTCAGGATCGNGAACCC 

TCCTGT 
SEQIDN0197 

TGCGGANCCNGGGCATGNCAGCACGCCNNAGGACCATGCNAGCAGNATNACTGCTNCAGNAG 
ATNGNNATGANGNGNNTGNCNNNATGTTTGTTAGTCCNGCGTGTCTTTTAATANATCATCNN 
TNACCGCNTAGGTTNGNNCANCACTGNCGAGGCTNTTANGTNANNTAAGAGTCTGTCNTNGT 

CTAG 

SEQIDN0198 

AGTGCAATGAGTATAGCTATTGAGGCTATTGAATCTGGCGCTGTCAATGCTGCTTCTGTCCT 
TGAGCAAATTGAGCTCCAAATAGCTCAAGCTAAAGAGGAAGCTTTTAGCAGAAAAGACATTC 
TAGACAAAGTCGAGAAATGGATTGCTGCTTGTGAGGAGGAGTGTTGGCTTGAGGAGTATAAC 
AGGGATGAAAATCGCTATAATGCTGGACGAGGCACCCACCTTACCCTGAAGCGTGCTGAGAA 

AGCTC 

SEQIDN0199 

CAAAGTACGANAGCGATGGNTCTATCTCTCAGCTGCAGGCAATGCCAAGTCGCCTTGACTTC 
ACCACTGAATTCCTCTCTCTAGCTGCTCATGAAGCTATTGTCTGTCGTTGTCATCCAGTTAC 
TGTTGCTTCTCTGTCACTTCTCTTGAACTTCTATCCCACAGGGAAACAGATGCCAACAACTG 
AGGTTGTAGTTTTCAGGACTTTAGTCACGACTCTATCTCAGGGTCCTCAGAATGATTCTGAT 
ATCCTAAAGCAAATGAAACGAGCTCACACTCGGCTATCTGAGCTAGGTGCTGACAAATTTTT 
TGGGAAAGGTGAGATTGGGAGACGGGAAAGGAACTGGTTTTCAGTGAATGCATGGAATTCCG 

GTG 

SEQIDNO200 

CTGGGCNGACATNGCCTANTGNGGNNTTTCTGAGGNNTCCNATNGACATGATTGTGGGGACT 
CCAGGCAGGGTTCTACAACATATTGAAGAGGGAAACGTGGTTTATGGTGACATCAGATACTT 
GGTCTTGGATGAGGCTGATACCATGTTTGATCGCGGTTTTGGTCCTGATATACGAAAATTTC 
TTGCACCNCTGAAAAACCGTGCTTCGAAGCCTGGTGATGAAGGATTTCAAACNGTGTTGGTG 
ACAGCAACAATGACAAAGGCAGTTCAAAAGCTGGTTGACGAGGAGTTTCAAGGGATTCAGCA 
TTTACGTACTTCTACATTACATAAGAAGATTGCTTCTGCTCGTCATGATTTCATCAAACTTT 
CAGGTTCTGAGAACAAGCTGGAGGCGTTGCTACAGGTTCTTGAGCCAAGNTTAGCAAAGGGC 
AATAGAGTGATGGTATTCTGTAACACGTTGAATTCCAGTCGTGCTGTGGATCACTTTCTCAG 

TGAAAACCAATTTTCTACTG 
SEQIDNO201 

TTTNNCAACCTTTGGTATTGNGCTCACTTTTTNCTATGGCAGNCTTGGCTTCTTGGTGGCTG 
GCAAGAACGGAAAGAATGGTCAANNCTTCATCAACCATGCAATTGCTTGAATCTGTCAGCAT 

CG 
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SEQIDNO202 

ANGTTGGCCAAGGCACTTGCGAAATTCTTTGGTGCCAGGCTACTGATAGTTGATTCTCTCTT 
ATTACCTGGNGGATCAACTGCCAAAGACATTGACTCTGTAAAGGAAAGTTCTAAACCTGAGA 
GAGCAAGTACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO203 

CACTTGCTACTCTAGCAACTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATA 
TGATGAGTCTAAATTAGGAGTGAGGCATTCTCAAATTCATTGCTCAGGGAGCAGAAGTTGAT 
ATGTGGATTGCTACTATTTGCAAGAGCACTTTGCGGGCATGTTAGGCAAAGTCATGTTTTTT 
TGTTCCTGATCAGCATTCTTCACTATCTGCCCTTTGAAACAGTTAGCCATAC 

SEQIDNO204 

NGACCGCNGCGATNCTAGAATCAGTTGANANTTGNNGNTNGGACATGGNATNTCTNGCNCNT 

GAAGCNTTTTTGTCATCGACGATNATGAATTTCTACATCCGGTNCCTCCAGAAGACAAAAGC 

TCATGCAGANGTAGATGAGTATCACTTTTTCAATNCATATTTCTACAAGAAGCTCNAAGAGG 

CTGTACTGAGCAAGAAAGGAATNGCANATGCTTTGNTGGAATATCAACTNCCAACAGCTACC 
TGAAAGCAAAGTGCC 

SEQIDNO205 

TCAACCAAGNTGTTCCTATTGGTTCAAGCTTCTTCTTCAATCAACTTGCCTTTGCATTTTCT 
TCCAAAGAGGNATTTTAGGTACAGNAAAAAGATAGTGCCTTTGAAATTGCGGTTTAGTTGTA 
AAAGCAAAAATCTTGAAAGAGAAGCATGTGCTGATGATACTCAAACTGCTAAAGCAATCACA 
TCGCATAGTTCTAAACTCGAAGACGTTATCTGGTTTCAGTGTCGGCATATTATCAAGGGCTT 
GGCTTCCAGGATTTCTCCAACTGAAGAGTGAGGATGGCTTCCTTCTTCC 

SEQIDNO206 

NCAATGACAGTGCTTGCTCCAGCTTCCCTTACTTCCCCACCTTCTGTGGTGTCATTGAGCAC 
TTCACCAACATCACCAATGAGTCCCTTCATTGGTTCTTCTGATTTCACAGAGAGAGTGAGTA 
TCGATAAGCAAATAACTGCTGCTCAGAGCAATAGCTTGGTATCAG 

SEQIDNO207 

TTGCTTCCTTGATAGNGCNTGACAGANGCCTNANGACCNTCCANTCAGGATNACTCATTCNG 

GAGGTTGCNTGAGGAGATTTTNTTTATGTTTTTAGACTGCTGCNCTTTTTTATATCATCNTT 

TACCGACTAGCTTTGNACAGANNNGNCNAGGCTTTNAGGGGANGGTAGAGTTTGTCATAGTC 
TAG 

SEQIDNO208 

AGATGATAAAAGTCTGGACGAGGNTGGGGACCCGACGCTGCCCATCTTAGAGGACGGCGTAC 
CAACTGAGAATAAGAATATAACTAACTCACACCTTNCTCTGCAAATTCCAGACCTC 

SEQIDNO209 

CNNNNCACTCNNGAAATACTTTNNCGCCNGGCTACTGATANTTGATTCTCTCTTATTACCNG 
NTNTTTCAACTGCCAAAGACATTGACTCTGNAAAGGAAAGTTCTAAACCTGAGAGAGCAAGT 
ACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO210 

ttttganggccttaagctacatcngaggaatgaagagtattatcggttgatttcaccacaca 
tataatgcgcactggacctacataatcctggaaactgtaatgtcaaggnggttgaagcggaa 
tcttcacaatcattcaaaggaagaatcacaatgctgcctatcagttttnatgatgc 
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SEQIDN0211 

CGCGTNNGTCTTAACGGCTGGTCGGCTGGCATANCGGTNATACGGNTTATTNTGCCAGTAAG 
TTTGGCCTCAAAGGACTGGCAGAAGCATTGCAGCAGGAGGTTATTGGCGAAAATATTCACGT 
ATCACTAATATTTCCCCCGGACACTGAAACTCCTGGATTTGCTGAAGAGAACAAAAGAAGGC 
CACGGGTGACTAGTATAATAGCAGCCTCTTCTGGTGCCATGAAAGCTGACGAAGTTGCCAAG 

ATAGCTTTGAATGGCA 
SEQIDN0212 

GGCANTTNTTTTCTATTACTTCCCAGCCTTGGGTGGAATGGAGTATGTCTTACATCACGGGC 
TCTCNATGTTTGCAATTGTTCAATCCCTT 

SEQIDN0213 

GNTGGAATGCCNAAAGAAGCANNCGACCCCTGTCCTATCAGTGTCTATTCTTCAGTTTGCTA 
GATGAGAAAAGATCATATTTTGTGCAAAGCAATCAACCAAACTTAGTTGCTGTTGCTGTAGT 

SEQIDN0214 

CTTCTTATCTTCTACAAATTGGCTCAATATTTTTAGTCTTACNATTTTATCTTTTTNTAATT 
TTNATAAAGANATATAATTNATTTGAGNGA 

SEQIDN0215 

TTCNGTTACTGGGGNTGATGATTTGACAAACCCGAAGTGGTATGTGGTGTGGTCTGCAAATA 
TGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTACAAATATGGACGTCATATGTCAGGT 
CAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGCTTCAAATGCAATGGGTACA 

SEQIDN0216 

GNAGCAATTATCCTTCCTTTTCTCTTTTCAATTATTTTTCGTAAGGGTAGTTGCTGATAGGT 
TTGGAGGTCCAATGGCGATTGGCATGAAGCAGATGTCCATAATAATAGCAACCCTTGGTGTT 
TTATCCTTTGTATTTGGAGTTATTGCTGAAAACAAGAAGCCTGCANCTGGGACTGCAATACC 
AGGAAAAGGCGTTGTTATTTGTAAATACAAGTCTGACCCTACTGTTGCCTTGGGCTATTTGT 
CTTTTGCTCTTCTTGTTGCATCTTCTGTGGCCGGTTTCCTGTCGTTATTTTATCCGTATCAA 
GGGAAGTCAATCCCACAAGCTGCTTTGCTCAAAAACACTANTTTTGNTGTGTTCCTCAACAT 

CGCATTGGGCACAACTGGTTTAGCAGCAGCA 
SEQIDN0217 

AATGAATATCTCCATGACTAGAAAATTGTAGACATGACACATTCTTTTCTTCTGCTTTGCAG 
GCTCAGTGAAGCTTTCATTTGGCTAGATTCGTGGATTTTGTTATCAGT 

SEQIDN0218 

CAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAGGAAATAGACCATATGATGAT 
CCATTCTTGGCATGTTGTTGTTGTCCTTGTTTTGTAGTTTCTTCTACTTTCTCTGTG 

SEQIDN0219 

GNAGGTTCATATCTCAAGGTCAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAG 
GAAATAGACCATATGATGATCCATTCTTGGCATGTTGTTGNTGTCCTTGTTTTGTAGTTTCT 

TCTACTTTCTCTGTG 
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SEQIDNO220 

GGAAATACCCCACGCTTCAGCAGATCAAGGAANCCTTCCTGAATCAGCTACTTCTGCTGCAC 
ATGAGGCATCTATTCGCGAATTTGCNGAGGCTGTCCGTGCTTATCGAGCTATTTTCCATGAT 
TCAGAACAGCAACTCTCTAGACTTGCACAAAATGTACCTAAAATGCATTTCGAAGCNGCCCA 

GCAGCACATCAAGAAACGACTTGCTTCTTCCAATCTTGTTGCCATG 
SEQIDN0221 

GAAAAGCAAAAATAAGNCGACATGGGGTTTATTTGGAAGAAGAAGATTGGGACTTTGAATAA 
TAAGAAGCGTAAGTGGGAGGAGAATAGATCACGGCGGCAGCCATAGCTCCAGGGAACACTAC 
ATATTTAGCTATCAGCGCCACCTTCCTCACCAATGGCTNTCCCTTCATCAACGCCATTTGC 

SEQIDN0222 

NCTNNAAAGGCCACTCAAGTTCCTGATATAAAANGACATAGTATTTGCACATCACCTTTGCA 
CATCCTTCCATTTTCTTCATATCAACTCTAGATCACTCATAATATCCGATGTAAATGTCATG 
CAAATAGTTATTGTACTATATTGTGTAAGGAATAAGGACAAGAAAAAAGTCTGTACATGTTC 
AGTACAGACGCAATTTTTTTTTCCAATATTTCCAATCCTTGGTTGCC 

SEQIDN0223 

TTTGCANACAGTCCCNTACTTTCCCGACTACTTNCAATAGGATNCTGAAGATGCCTTTGATT 
CATCTTCTGACATTCTAGCCTATCCGTGGNCAATCAACACAAAGTATTATAATGCTGATGGT 
TCTATTTGGATGGCCCATCTTNATGAGGACTTNTCCATTGGAGCTTTGCCAGCATTTGACCA 
CCTTATTGCATTGGTGTTGGTCTTCGATATTAGTGATCTCTCATCTTTTGCTGCGCTGAAAG 

ATTGGGTTTCTCGCACCGACATCC 
SEQIDN0224 

ACCCAAATCCTCCTCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTT 
GGTTGCAAATGTlAAGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTGAAACAC 
CTGCTCAGCCCACACCAAAGAGATCGGTGCCAATTACCGAAAATAAGTACAAGAGTATGACA 
TGCCAAAACTCTGTTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGG 
TGGTGAAGAGATATCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATA 

ATGCCC 
SEQIDN0225 

TACCGGATGANTTTGTNGATGATGANCATTCCCAGTTTTTGNTTGCACCTGCTNTTTTGGCC 
AGNTTTCNATTGAGAGAGAGGAGACGCNAAAAGCATGCTGCTGNTTTAGNGAAACAAGATGA 
TGAGGNAACCGTNAAGCTTGAAAATGCTGCCCTTGAACGCTCTAAGTCAGTTGACTCTGCTG 
AGCTGGGGAAGTATAGCATATGGNGGAAAGAGAATGAANATGAGAATACTGATTCAAAGGNA 
CNCTTGANGCGGGACCAAATGGNTACNGNAAGGCTGTNTATAANCATNGCAACAATGAAGAA 

AAAGATNGGCTTGGCTCAAGAGTTAGAGAATCGGC ' 
SEQIDN0226 

TTTCTTGCNTACAGGATNTCAAGGCATTGTTCCCCTCTGATAGAAATCCCTTTTATGCTGGT 
TTTGGAANNNGAGACACCGACAAGCTCAGCTACCTCAAGGTTGGAATACCTGAAGGAAAAAT 
CTTCACCATCGATCCAAAGGGTCAAATTCTTATGAACCACCACATAGATACAAAATCATACA 
CCTACATACATGGTTGCGTCGATGACATGTTTCCACCCTTGTCCTCACGTGAGCAGATTTGT 
TATGGACAAAGTGTTACTATTACTAAGAGGAGGTATTTTCGCCGTTGATGGGTAAATGACTG 

Cir^MPP A /rnn + !niiDH\ 
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GAGAGTGATGCTACTGATGGATTCATCGTTGCGAATGACACACATACTGGAATTGTGATGAC 
CTGCATTTGAAGAAAATTCTGTCTCTGAACCTTTAGTAGGGGATATGGTCTCTGTATAACTG 
GTTTTGTCAAGAAATGTCCACC 

SEQIDN0227 

TATCAGNCGAATCTAATTTTGTACCCGGTGGATTGTTATGTGGT'CCTCAATGNTNAAGNAAT 
ATGNNCGNTNTTGT 

SEQIDN0228 

TTATGGCTTTCACTCCAAGAAACCCTCGTGCTGGAAAGCCACCTGATCATTACATAGAATAC 
ATGCGC 

SEQIDN0229 

CTCCTATACCTGANAGGACTCTCACTTTGGAGCCTCCTCTAGGTATNTAATTCCGNCCTTTG 
NNGGGAGACTGACATCGGNTACATGCAGCGNNGCTTCAACTGTAAATATTCCACCTTNATGG 

CCTTGCCNTCGT 
SEQIDNO230 

CACCCAGAAAATNATCTATAAAGTATTATGATCCAGGACGAGCTGACTAAACTAGCTGATGA 
GGAAGATGACGAGGAAGAGGAAGGCGATGCTGAGAAGGATGTAAAAAAGCCTTCTGGCAAAG 
GTGTGAAGGCCTGAAAANACATGGGNAGANTGTNANCACANNANAGGCCCNCNACNCTATCN 
ATCAATATCCAACCTTTCTCTTCCTCGTGAATTTGTGCCTTGTGAGTTCAACCAGCTGTAAT 
CTATTC 

SEQIDN0231 

GANNCCNTTNNCTTNCTAANTAANNGCAAAAATAAGCGACATGGGGTTTATNTGGAAGAAGA 
AGANTGGGACTTTGAATAATANGAAGCGTNAGTGGGAGGAGAATAGATCACGGCGGCNGCCA 
TAGCTNCAGGGAACACTACATATTTAGCTATCAGCGCCACCTTNCTNACCAATGGCTCTCCC 
TTCATCAACGCCATTTGC 

SEQIDN0232 

GCCTTGANTTTGCGCNCCNAACAGGATGATCTTGTAGATGATGATCATTCCCAGTTTTTGGA 
TGCACCTGCTTTTTTGGCCAGAAGGCAATTGAGAGAGAGGAGACGCGAAAAGCATGCTGCTG 
CTTTAGTGAAACAAGATGATNAGGTAACCGTNAAGCTTGAAAATGCTGCCCTNGAACGCTCT 
AACTANGCTGGCTTTTTTGTGCTGNACCAGAATCCCTTATGNNGNACANNNANTGATANTGA 
GAANTCNNANTCAAAGGTACGCTTGATGCGNGACCAAATGGTTACGGCAAGGCTGTATATAA 
GCATTGCAACAATGAAGAAAAAGATTGGCTTGGCTCAAGAGTTACAGAATCNGC 

SEQIDN0233 

TTGGACTTGAGTGCTTGTAGATGGTGCCTTTGCCGATACCCACGGCATCCGCAATCATCTCG 
ACGGTGACGCTGTCTTCACCCTGGTCGAGGAACAGCTTGAGTGCGGTGTCGAGAATTTCCTG 
CTCGCGGCGGCGAAACTCACGGACCTTGCGGGGTTCTTTGTGCATAAGAAAAAGGTCTGCAG 
AGGTGGATATTGGGAGGGTGGCGTGCCCCAAGCCGTGCTTGTCGGAACGTTTTACCCGGTGC 
GCGGGATTATCCCGACTGAACGGTCGTTGGGCAACGCCTATATGAACAACCTTTGCAC 

SEQIDN0234 

TCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTTGGTTGCAAATGAA 
AGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTGAAACACCTGCTCAGCCCAC 
ACCAAAGAGATCGGTGCCAATTACCGAAAATAAGTACAAGAGTATGACATGCCAAAACTCTG 
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TTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGGTGGTGAAGAGATA 
TCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATAATGCCC 



SEQIDN0235 

TCATGTGCATTTTGACTTTGGAGACTCAACACAGGGGTTGGGTCTGTCTAGGACAGGTGCAC 
CTGAAATGAAAAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTATTCAAGGGTA 
AAAAGGTCATTTGGCGGACCAATGATAGTTGAGGGCAAGTGAAAAAATGAAAAAAATGAAAA 
AAGGGAAAAAGAGAGGGTGAAGTGTGAGGATAAAGCGAGCGGGGCCTAATTAGGTTATCTGT 
TACATTTTTGT 

SEQIDN0236 

TCACGTGCATCTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTCTAGGACAGGTGTACC 
TAAAATAACAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTCTTCAAGGGCAAA 
AGGGTCATTTGGCGGACCAATGATAGTTGAGGGAAAATGAAAAAGAAAAGAAAANGAGGTTG 
AAGTGTAAAGATAAAGCGAGTGGGGCCCGATTATATTTTNTGTCACATTCTTG 

SEQIDN0237 

AGCAGNCANGGATNAAATGGGAAAAACNTGTCAAGCCTANCTCTACCACCAA71AGAGANTGA 
AAGATCNGACTNGAGCACACCACTNGATACATAGGTATCAGGCACATAGAAGATTAGTTACT 
GTTTGCCAACCGAAGAAATTCTTTCACTACTGATGGCAAGCATACCA 

SEQIDN0238 

AGAGGTATCCTTCTAGTGTGGATTTTGATACTGGGGTTGATGATTTGACAAACCCGAAGTGG 
TATGTGGTGTGGTCTGCAAATATGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTAC7^A 
ATATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATG 
CTTCAAATGCAATGGGTACA 

SEQIDN0239 

GAGNGGCAAGTCCTGGCGCTCTATTTCCGAGAGNAGAAAAAAGAATTTTGTTTTTGCT 
SEQIDNO240 

GGGTNCAGGCNTCTGGCAANTCCTCGGTGTCANNGNTACACAACTGGAGATGGAAAGAAAGT 
TTATGTCGCCAAAAATGGGCAGGAGTTTTCTGGTCAAAGCGCATATAGATGTTACAGAAAGG 
AGACTGGAGCTGGTT 

SEQIDN0241 

GGTTCTATCTAAANNATTTAATGACTTGCAGCTGZ\AGGATGGTNTGTACACTAGCAAAGCTG 
AACTGCGGAAACGTATCAGG7\AACTCAAAAATGGGCCAGGAAGAAATCACACTGCAGGTGGG 
AGGGTTGAAAAGTCTAAAGGTTTTTCTCCCAATAGCTTTGATCGTGTGCTCCTTGATGCTCC 
TTGCTCTGCATTAGGT 

SEQIDNQ242 

TAMANTACGACCGAGAACATCCTNATCAAATGCTCTCAAGCTGTTCTTCATCAAAACCTTTT 
TTCGGGTTCATCTTCAGTACCATTACAACAATCTTTGATTGAAGCTGCGGGTAATGTTGTAC 
AACGGGCTTNGAACTCGGCCCAGCCCGAACATCAGCCCAATGTGGTAAAAAATCCGAAAAAA 
CGAACCAGGGCATCAAGGAGAGCGCCAACTACTGTCCTTACTACTGACACCACAAATTTTCG 
ACAAATGGTTCAAGAATTCACTGGCATCCCTACAGCTCCGTTTACTGGTTCAGCCTACACTC 



r 
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GCCGCCTTGATCTTTTTTCTACAGCTGGCTCAGCGATGAGGTCGGGTCATTTGGATACTCTT 
GGGCCACTTTACCCT 



SEQIDN0243 

AAGACAGGGATGGCAGTGCTGAGAGNAGGGCAAAGATTGAGCAATGGAATAGGGAAAAAGAA 
GAGGCAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGAGTGATGGTGGTGA 
TCACTATGGAGAACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGAAGTT 
ATGGCTACTGATAGTAGTGTTACTCTGGGTGGAGTACAGGTCCACTGTGCTGTGATTTTGAA 

AAAAGCATAACCCTTCTATTGTCTTCTTTTTACCATGT 
SEQIDN024 4 

GGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAACATTGACATCTGTTCAAACATCTG 
AAATTGAAGATTGGACAAAATACAAGGATGATGATATTATGCAACAGCAATCTTCCATCCAG 
GCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAAGGAACCTTTGTCTTCATTAGAAGC 
TGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAATAAAGGTGCTGAGTGAACAATATG 
CTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTTTTCCGCAGTTTCATGTTTGGTTAC 

CTTGTATGC 
SEQIDN0245 

CCGNCAAACAAAGTAAAAGATGCAGGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAA 
CATTGACATCTGTTCAAACATCTGAAATTGAAGATTGGACAAAATACAAGGATGATGATATT 
ATGCAACAGCAATCTTCCATCCAGGCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAA 
GGAACCTTTGTCTTCATTAGAAGCTGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAA 
TAAAGGTGCTGAGTGAACAATATGCTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTT 
TTCCGCAGTTTCATGTTTGGTTACCTTGAGCACATTCTGGAATCACAAGAATCA7\AGCGAAG 

TTCATCGCA 
SEQIDN024 6 

GCAAACCTGAAAGAAANGANTGGCAATGATATTTNNTCTGATGGCNAAGGTGAANCCAGAGA 
TTACTTTGGTGGCGTGCGCAAACCACCAGGTGGAGAGAGCAGCATTGCACTAGTTTAGATGA 

TG 

SEQIDN0247 

GAATTTTAGATTTCAATGGCTCTCAATGAGTTACACGGAATCAAGNTCTAAAGTACCTTTGC 
GGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGTTGG 

SEQIDN0248 n ^ 
CAGGGTGCTTTGTGACATATCCCTGCACTGATCACCCAGGTGACCTAACTCTGGTCTAAGCT 

CTGCCTAAAGGGGCATTGTGACAGATCTCTGCACTGATCACTCAGGTGATGTAACTATTGTC 
TAGGCTCTGC 

SEQIDN0249 

GAAGAGTGGACTCTTTATGAGCAGGTAGCTGTTGCAGCTATGGATTGTCAGTCTCTTGATGT 
GGCAAAGGACTGCATAAAGGTATTGCAAAAGAAGTTTCCAGGGAGCAAAAGGGTTGGTAGGC 
TAGAAGCTATGTTGCTAGAGGCCAGAGGATTGTGGTCAGAGGCAGAAAATGCTTACTCAAGC 
CTTTTGGAGGAAAATCCCTTTGATCAGGTTGTACATAAGAGGAGGGCAGCCATGGCAAAGGC 

GCAAGGCAATACGTCAGCAGCAATTGACTGGC 
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SEQIDNO250 

TTCTGCTGGAAAGTACTATGATGGTAGATTTGATGAACCCCAACAACAATATTTTTTGGATG 
CTTGTTTCCTTTGTAAGAAACGCCTTGCAG 



SEQIDN0251 

AAACTAATATACGAGTTGTGTCTGCATCTTCCTCAACTGGTTTCATAGATTGATCAATTGTG 
GCACCGGCAATGTGGTGGCGAGGTTTACATCCTCGAATTGTTCCACCGGTCAAAAGATGACC 
TGCTTTTCTGAAATGAATTCTGTTTCCCAAGCAG 

SEQIDN0252 

GCACATTTGAGAGCNCNNGCGCANTGNCATNTCTTNAGCAGNGGAAGAGTAANTTCTAGATG 
TA7\ATACCCTGCTTTCCCGTAAGAACTGGTTTATATTGAAAGCAGAAATGCCTCTGCTGGCC 
AATTTTCGACTTATAATTCCAGACATATCCACTTCCTC 

SEQIDN0253 

AGCAAGTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATATGATGAGTCTAAA 
TCAGGAGTGAGGCATTCTCAANTTCATTGCTCAGGGAGCAGAAGTTGATATGTAGATTGCTA 
CTATTTGCAAGAGCACTTTCCTGGCATGTTAGTCANAGTCATGTTTTTTGTTCCTGATCAGC 
AGTCTTCTTTNACTATCTGCCCTTTGNAAGAGTTAGCCATACGTTAGAGCAATGTGTTCTTT 
TCAATGTTGGATATTTATTTGAACTTGATC 

SEQIDN0254 

AGTANGCGTAGGGAAGACAGGGATGGCANTGCTGAGAGGAGGGCAAAGATTGAGCNATGGAA 
TAGGGT^AAAAGAAGAGGCAGAATCTGCTAAATACAATAlATTTTGACACTGATAATGGCAAGA 
GTGATGGTGGTGATCACTATGGAGAACAGCTNGATGANGATTACCCNANGCAGCANTAGGTA 
NCAAGATGGAAGTTATGGCTNCTGATANTANCGTTACTNTGGNNGGAGTACANGNCCANTGN 
NCTGCAGATTTTGNANANAGCATANCCCTTCTATTGGATTCTTTTTACCANGT 

SEQIDN0255 

AAGCTAACAGAATCGTTTGTGGAATAAGGGTGTCGATTCCGAGAGCTTCCACCCCCGTTATC 
GGNCTCATGAAATGCGACTAAGACTAAGTAATGGAGAATCCGATAAACCTTTGATAGTCCAT 
GTTGGACGACTTGGAGTTGAG7\AGAGTTTGGATTTCCTCAAAAGGGTCATGGATAGACTTCC 
AGATGCTCGCATTGCTTTTATTGGAGATGGGCCATACAGGGAGGAATTGGAGAAAATGTTCC 
ATGGCATGCCTGCCGNGTTCACAGGTATGTTACTAGGAGAGGAGCTTTCCCAAGCATATNCC 
ANCGGNGATGTTNTTCTTATNCCTTNANAGTNAGAGACACTGGGGCTCGTCGCTTTGGAGGC 
CATGTCATCAGGGCTTCCTGTAGTANCTGCCCGTGCCGG 

SEQIDN0256 

CGNTTTNTNCTCGGNGNGTCAGCTNNGNGGANGCNCTGGGTGCTGGTTCNNAGGNCTNATGA 
AACGCTCNAAGGCAACAATCTGGTTATGACAACTGCGGGAAAAATTCCCTTTCTGCGCAAAC 
TCTCAAAGCGATGGACTNGCNATCANAGCAANTTTTGCATCTGC 

SEQIDN0257 

atgangangaggatgaagaagattataagccaccacctaggaagcaatctgataattctgat 
gaagatgcggagtcttttccgttgaaacgaaagctatctccgaaagaagagcctgagccaaa 
aaggttgcagcggattgctaaaggctcaaagtctcgagacggtgttttcgctgctttgtgct 
caacc 
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SEQIDN0258 

ATGATCAAGCCCTAGAATTCGCGAAGATGCTCGATCAATCGGGAACTGTAATTGTTTTGGGA 
AATATCGTATTCCTGAAGCCTGACCAGGTGGTGAAAGCCATGAAAGGCCTAATGCCAATGCC 
CTTGGCCGAACCAAATGACCCAAAAATGATGAAGGAGCTTCAACAAATGGAGGAGAAGAAAG 
CAGCAATTGACAAGAAGGCAGAATCATTGGTGCGGACAGAGTTGTGGCGTGGACTAGGTTAC 
TTTGTGATTCAGACTTCAGCTTTCATGAGGCTCACTTTCTGGGAGTTATCATGGGATGTAAT 
GGAGCCTATTTGCTTCTATGTCACATCCATTTACTGCATGGCTGGGTATGCTTTCTTCCTTA 
GGACCTCCAAAGAACCTTCTTTTGAAGGGTTTTTCCAGAGCCGGTTTAGTGCAAAGCAAAAG 
CGATTGATGAAGCTTCATAAATTTGGATCTTCATAGGGACCAAGAGCTCCACAGAGCTTGNG 
ATCCTCATTCGACGATACCTGGTGGAAACACC 

SEQIDN0259 

CAGATGGTACTGTAAACATGTATGTTCATCATGAGATTATTATTCCTGCGNTTCCTGTCTGC 
ACAGCATGGATCGATTGCCCTA 

SEQIDNO260 

TTGNTTACTCNGCCCTTGNATTTCAATGNGCTAATCCATTANCCCNCACGGAATGACGNTCT 
AAAGTACCTTTGCGGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGATGG 

SEQIDN0261 

CAATAANTTTATTTGGAGGCTTTCCTTCCCTGCCTGGTTTGATGTCAATGACCTATCTGAAA 
ATGCTATTGATGATGATGAGGGTTTAGATGCTTCAGCAGCATATGTGGCGAGTTTGTTGGCT 
ACGGAGCCCCCTCACATCAAACTTGGGGTTGGAGGCTTCAGCATGGGCGCAGCGACATCTCT 
TTATTCTGCAACTTGTTTCACTCGTGGGAAGTATGAGAATGGCAACTCGTACTCTGCCAATC 
TGAGTGCAGCTGTTGGA 

SEQIDN0262 

GGATCAGGTTNTAGCAGATACACTATAATCANAGTTGNNGTGGTCATGGGGCATGGNTATAT 
TTGGNGGAAGGGGTGGAAGCTTNCCGAA 

SEQIDN0263 

GTGTCCCAGCAAGGATTACCCAGGTGATGTACCTCTCATCAAGGCTCTGCCTACAGGCACAT 
TGTGATGTATCTCTGCACTGATCACCTAGGTCATGTAACTTTTNTCTAGGCTCTACCTACGA 
TGGCATTGTGACATATCTCTGCACTAATCATCCAAGTGATGTAACTCTTGTCTAGGATGTGC 
CTAAA 

SEQIDN0264 

CCGNTACTCTCCGCTNGACCAGNTCGTTTNCTTCCCCTTTTTCAGGCTGGTGACACACTANT 
ACAGTCAGTANGACAACTTCATCACTGATTTTGAGACAAAGATCAATCTTNTCAAGCTTGCN 
CATTTTGCGGTCATTNNTTCTCNGGAANACCCNGANAAAGAGGCTGNTATAGGTTACCTTGA 
AGGAGAGACTGAGAAACTTCNNNATACTAAGGAGACACNGATAAAGGAGCCGATTCTTTATA 
T 



SEQIDN0265 
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GGCTGTTAGTGGCTCAAAAATTGTTGGCTCAGCCAAAGCAGAATCCATTGAAAGTGGTGAAA 
GGACTCGTCACATGCAGCCTACACTTNCGAATAGTCCACACCCTTCTCTTTCTTGCAATGCT 
GTTGTATATTCTGCATATGAAGCATCCAAGGACGAAGTAACCCAAAATAATGCACCAGCTAC 
TGATGATTGTGGATTCTTCGAGTCAGGCTATATGCTTGCGAACGGGACAGGGCCTCCTATTG 

GAGAAAGCAACTATGACGAAGCTGTTGAATTTGATCCAA 

ACA^TGGTTACAGATGTTATGGAAAATCTTGTCAAGAGGGCTATAATGGCTGAATCTGAAA 
CTGCTTTAGAGAAGGAGAAGGTAACAATAGGTCNTGAAGAGATTCAAAGAAAGGCGCTTCAG 
ATTGAAAACATGTCAGGTAAGTTAGAAGAGATGGAAAGGTTTGCTTTGGGTACAAATTGTAT 
CTTGAATGAGATGCGCCAGAGAGTTGAAGATTTGGTCGAAGAAACTTCTAGACAGAGGCAGC 

GAGCTNCAGAAAATGAGCAGGAGCTTTCTCGTG 

GNNTNTGGANGCTGNACATNTCATCCTCANCNCAGGCCTANNCTTAGNNCNAGGNGCCNNCC 
ATNNTNCAGNTNNCTCTTNCCGNNATTCTANTNATTCGTGCACATGNNGAAACCTATGCTNT 
TGCGNCNGCTNNANGNACANTCANNNCTGCANNGNCNGANCCTTCNTGCNCANCNTAATCAA 
CCTTNCAACNGCATGATGACTCTTCATGCATAGCCATATGNTATCTTCATTACGGGCTTTTT 

CAGACATACCGCTTCGTTAGCAGGCATCTTACCC 

SEQIDN0268 ^, m ^ TlTl _ 
GATATTCGTAGGGCGAGGACTGTTATCTTACAAAGGATCATCAAACCCCCAAACCACTAAAG 

TGCTGAAATTTGCCTTAGCAGCAGNGAACATTTATCTGCTTTTCATAGTTTGTGATG 

GGGTCAATACTCTGTCTTCACTGCGATCGATATTTCGCGAATGTTGCGGTGAGGGACGAGCA 
TTTCAAGACGAAAAAGCACAGGAAGCGTGTGAAAATAATGATGGGCCCTGCACCACACACCC 
AACTTGATGCTGATTTAGCTGCTGGAATTGGCATGCCAGATAATGGTCCAAAGCTAATGTCG 
ATGAGTTGAGCTTCTTTCGTCCTGTTTATAACTCCTACATTACTGGTAGAGTTCTTTTGAAC 
TTTGAGAATTTGTCTGAGGAACATAGGTTTTTGTTAGTCTACCATCTCTCTCTCAGTATAGC 

AAGT 

S EQI D1SI O 2 7 0 

TGATGACCTTTNNGNATCTNGTAATATNTGAGAACAATCCAAACGTTGAGAGCTGCAGCAAT 
TGATCAAGTTACCCTCTTNGAAGAACAGAAGATATTAGCTACAGAACAAGCACAGATGGTGA 
AGAAGCTTGGTGATTCAGAAACGAAGACTGCAATGCTCAAGTCACAGGCTGAAAGTTTAGCA 
AATTACTGTGATGATGTGGCCAGCACTAATAAAACACGAGCGCTGCAGAAGGGAGTCTGCAA 
GTATAGTTCCTATTTTTTGATACAGNTGGTATTGCTGGTTATCGTCTTTGGACTGTATGTTT 
TGCAGATGTCACCTGATGCTGTTGAAGTTGTACCGACATAATTTTGAGAAGTGAGCCTTTTT 

CCTTTTTCTTGTATTTTCAACATAAAGCAACGATGAACG 

AGGGTTATTCGGGTCGGACCTGGCGAATGCAATTGCTAAAGATACAACAATTTTTGATCGAG 
GTTTAGATACNCATTTGAGACCTACCATTGATTGTCTTAGGAAAACTTTGGGCACCGATGAA 

AATGTAG 

CACTCAAANTCCNGNCAGAATCCGGNGAANTTTTCGGCGAGACATTCCAGTAGAGTTCTTGT 
CCGAGGTTTTGACATTTCAGATTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 
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SEQIDN0273 

ATTGGCCGGTCTTGGACTTCAAAAAACNNTCNCAGAGCTTCGAAGTATCACTTCTAAACCTC 
AATCGGAGAAGAAAAAATATAACANAGTTGACTATTTCTCTACTCCTTTGCGCCGTTCCGAT 
CGATTGAAAGGCAACACCCCTCCCGAATCAGAATTGCGCCGTTCGGGTCGCTTGAATGAGAA 
GTCCTGCTACTCTGCTCCACCAGCAAAAAGGAAATTGGGGCTTTTTGAAGAAGGAGATGTTG 
AAGAAGATAATGAGAAGAGACCTGCTAATGCACCTCTCCTGAGAGTGAAAGATGGC 

m 

SEQIDN0274 

GCCNGCTGTGNNCTGCAGTTGTTGTAAAGGTTGAAGTAGCTCTAGACAAAAGCATTTGCATG 
TTGACCAGATGAGCAGAACTGATGTTATTTGCAGTAGAAGGAGGAGGTTTCTTCTCGTCTTC 
AGCTTCTGGATATAGTAAGGGCCTGACCCTTCTACTCTTGGGTCAGAAGAACGAAGAGAAGC 
CCATGAGAGTTGCACCGTGGAANCAGTACCAGTTGGTGGACCAAGAAACTGATCCGGACCTC 
CAGCTGGCTTCCGGGAAGAACAGGGTTGTCCGCGGGTGCGCCTCCTTTGTATGCTTTGGTCG 

CGCTGCCGCTGGACTTGAGAGCCCATCTCCCC 
SEQIDN0275 

CGTTCTNCTGGATNGTTCCTGGCTATATTATGGGAGGGGAAAACAGGAACAAAGAGAAAGCA 
AGATTGCGAAAAGGTATATCTATTCTTGTTGCAACTCCTGGACGTCTTTTGGATCACCTAAA 
AAACACATCATCATTCTTGTACACGAACCTGCNCTGGATAATTTTTGATGAAGCAGACAGAA 

T T C T G G AAC T T G GAT AT G G T AAAG AG AT T G AAG AN AT AC 
SEQIDN027 6 

TCCAGGATGATGGCACTCCTGTCTCAATATTTGCACTTACGGGGAGTAATGCAAACGATGGA 
CATTTAGCTGCTGGCCGAAATGGAGTCAAGCGACTTCGCACTGTTAGGCATCCAAATATTTT 
GTCATTTCTTCACAGCACCGAAGCAGAAAATTTTGATGGTTCTACTACCAAGGTTACCATCT 
ATATTGTTACTGAACCTGTCATGCCACTCTCGGAGAAGCTAAAGGAATTAGGA 

SEQIDN0277 

ATGNGCAAATTTGCGATCCNAGCGTCAGATGAATCCATTACCCAGGAGATTGCTTCANATTT 
TCAGGGNTGGNTGNATGATCTAACTGATGGTGGTGTTGAGTACATGCCTGAAGANNAAGTAA 
AGGNGGCTGCTGCTGAAAAGCTAAAGATTTCAATGGAACGGATAGCATTACTAAAGGCGGCA 
AGACCTCCCCGAAGTCTCCAAAATCTGATGATGAAGAAGAAGAGGAGGAAGACGAGGATGAT 

GAGAACCAAAAGAAAGAAGACATGA 
SEQIDN0278 

TTGTCTAAGATAAAAAATGTAATAGTAAAGAGAGCTGCAGATGAAGACATGGAAACTGCTTC 
TATGTTGCTTAGGTGTTGCTATAATTTTTATAAGGACACTTTTTGTGCATTGCTCCCATCAG 
GTNTAT^ACCTTTATATGGTGCCATCTCAATTTGCTACAGAAACATATATCCAACCTGGGATA 
GATGCAGTTGACATACTCGATATGMCACTTCACGGAAGCTACTTTTGTGGGCCTACACACT 

TCTGCATGGCCATTGCACAAATGTCTCAGCTGCTA 
SEQIDN0279 

GCTTTCTTGCCTGCCGTAGACACAGTGNGAAGGGNGAGTGCCTACATGAATGNTTTAGAGTG 
AACCCTGATGGTGTCAAAGACAAAATTAGCTGTGGTGAGNTTCTGGATNTGACTCTNGAGGA 

TGNCGATAAATGCATAGAGCTTATTTNTACGCCGATCCGCAAAGATGCA 
SEQIDNO280 

GCGATACGAGGCGAAAAAACTAAGCTTCCGGAGAGTGTGAAAGCAGATNCCCTTACTAATGA 
AGCTTTTCTTGACCGGGGGTTTACTCGCCCCAAGGTTCTGATCATTCTCCCTCTAGCAAGTG 
TTGCATTTCGAGTAGTCAAGCGGCTGATTGATTTGACACCTCCTAAATAC7^AGTCTAATGTA 
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GAGGAGCGTGAACGTTTCTATAGAGAATTCGGGGCCGGAGTAAGCAAAGATAGGGAGGATGA 
AGATGCCGTCGAAAGCTCTGAATCAAAGAAGAGCTCAAAACCATCTGATTTTCAAGCATTAT 
TTGGGGGAAATAACAATGATCACTTCATGCTAGGAA 

SEQIDN0281 

GCGGCATGTGAAAATCAACTGNTTGTGATATCCCACCTACTGGAC 
SEQIDN0282 

GNGTACGGGGNCCGGGCATAGATATGCCTGNANGGAGTNNGACAAAGCTTGCAGAGTGGNTC 
ATCCTTGTCAGACCACCCCTGCATGTATATNTTCTNTTGNTTNCCTNTCCCAGTACAAAGAT 
GGACCTTACTCCAGACAGCGTATGGTGGTAACGGATAGCTAATTNAGTGCANAGGTGTTGNC 

CTCCTCTTACTTATACCTTTCAGCAGTCCCCCATTATCGTGG 
SEQIDN0283 

GCTNACTNACATAATAATNANNCCNGAAAANTT^AAACTTCTTTTISIAATTATAATCATAAGCT 
CTACTCGGAGATGTGAACAGCGAGTTTTAGGTGGACTTNTGAAAGAATGCCTCGATTCGTNG 
TGNTCCAGAAGGAAGCGGCTTCTCTGTTGATAATCGAGGACGATTTTGAACCTTAGGAGAAG 
GATCANACGGCTGTGAAGGCACGGGAAGCGAGTCGAGAAGGAAATCGTTCGTAGGTTGATGC 

CTTTTCACAGCAACTC 

Group 4 

SEQIDN0284 

GGCCATCGGAGCAAAAGAGAGCAACTTACATTCTTGAACTACGTGAAGAATCTTCAGAACCC 
T 

SEQIDN0285 

GAAGGTGAAAATGGATATNGCGATATCTCAAAGGCACNTNAGGTATGGCACTTTGTT 
SEQIDN0286 

CTGNTCGAATGGGATATGCATATTCATATGTCCTATTGTACTAATCAGAGTTTCAAGATTCT 
GGCTT 

SEQIDN0287 

TGGATAGGTNAGCNANGAGCANACGANANNCCTGACNGGGAAAGGGATGCANTCAGACTCTC 
ACTGGCTTCAGCAATTCTT 

SEQIDN028 

TTG7^AATANCNNNTGNNAANNCTNACATTAGCCNCTCTGTTGTGAGGAAAGGCCTATTCCCC 
CTCTCTATGTACTTCATTTCTGNCATACAT 

SEQIDN0289 

TGATNNATGCTCTNTAATTGCCATACTCATTGGTAATTGTGTTGATGNGCCTTNATAACGGG 
TTATNATGGCCTNCTCTCTTCTATTAGCGCCAAATGTAGGAAAGTCATTAGTTTGTGTTTAG 
TTCAGGAACAGACATATTTCAGCCGTGCCACCGGACATCGCATGATGTCAAACTCTGNGAAC 

TAATCTCACTAGAGACGAGAAGACNATGGCCCGCTAGT 
SEQIDNO290 

GCAGCAGAAGANATGAACCGAAATGAAGGCCTGAGTTCGGCCCCAAACAGCCGATTCAACAA 
CAGAAATCAATGCACAGATTCAATCTCGAGCAGAATGT 
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SEQIDN0291 „ m „ 
GGCATGANAGGAACATTCACNCGTATGAGCACGCATGTTGCAGANTCTCCTTCGNGGGGCTG 

NTCCAAANATTCACCACTATGTTAGCCCAGGAAATTCNCCTCCCCNTGATNCTTCTGCTCTG 
CAGT 

SEQIDN0292 ' 

NTCTGTGCCGGCTCNANTNNGGATACTACAGCCGAAACCCTANCGAGCGTATNNNNNAAGTG 
CGCAAGAGATTGACAGATTGTAANGCTGTTACNGAGAATGCTGNGTAGGGAAGTCCATAANG 

ACCGCGTGATTACTATGT 

NGGAGTAGTAATACCCGTGTGGATAGTACCAAACTCAATTACTTTAGGAGGGTATGTTGCTC 
AACCTACCAACTGGC 

SEQIDN0294 

CTTTNGGNAGTCCGAACNCCCTCNNNGANAGACCAAANNGATGCGNNNNGCTCNTGCAAAGG 
GTGAGGANCNNNATNNTNGCC 

SEQIDN0295 

TTGCAGAATTGATGTGGTTGCTTTGCTCTAAAAGTTGGAACT 
SEQIDN029 

TTTGAAGNCCTTTNANCNNCNCTNANAGGGGCTGNNGNTGGACGCANCACACGATTCACATT 
CTNCNCCTTAGNCGAACGTGGTGTTCGGAACAGTTTACATCACT 

SEQIDN0297 mi4 __ 
ANCCCANGGTTANATGGNGAATCACACGATNACANANCTTCTCCTNAGCCGACGCCTGTACG 

GAACAGCATACTCACT 
SEQIDN0298 

GNTTAGNNANCCNNCGGTNNGNGATNGGATGNNGNTNAGGGNCTGNTTCAATCCTGTATAGN 
GACTCTTTNTTACCCGTTGTGTTCCNCT 

SEQIDN0299 m „„„„„ 
CCTAGANAGCGNGCTCCNGAAGAGAATAAGGCAATNGCCAAAGTTGCAAAAGTTCATGCCCC 

TNCGTTAGCAGNTTGGATCAATTGGCACAGGAGGGCCTCAGCTNTGCCTCGAAGATCTAAAG 
CTTTAC 

SEQIDNO300 „_ 
GGCCCTGACGTCTCCTCTATATTTTATTTCCTATTTCATCTTTTTTGCTTCAGAAACAATGT 

NTCCTTTTATTCTCGGACCTTGTATTTAGCAGTCTTAGAACGTCGGTGACATTGTGACACTA 
GGTTTTGGGTGATTATGGC 

NTTNTAACATCACGCATGCATAACAAACTGTCAATTGGTGTGAATATTCAGAAGTCTCTTAT 
TCATATCAATNCTCAGGGGGAATATNACNACTCTCCAGGAAAAAGACGTTTCANANACGGAC 
AGCTGCNAAGAGATGCAGTATGACAAGAAATTCATTCCTCTTCCTCCGCCTCCTCCAGCCAT 

TTCACAAAGGGCTCCAGCGACTTGACAAAGTTTTGCCTGCCC 
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SEQIDNO302ATAAACTCTCAAGTCCTGGCGGCCAGAAAACCAGCTTTCAGGCGGGGCGGNG 
GGGCTCCCCCTCCCCTTGCTTCGTCTCTGC 

SEQIDNO303 

GTAGGAGTCGNGGATGAGGANAGAAGNGTCCTGAGNAATNGAGGGAGANGGTGGANGAT 

■ 

SEQIDNO304 

NCCCANTGNTTTGACNCNGTGGTGNGAGGGGTNTTAANATGATTNAGTGCTATTNGCTAGAG 
TGGNTATAAGNCTTGGA 

SEQIDNO305 

CNCGATNGTAAACGCCCCGCANCGGNTATGGNTAAA7VAGNAGACCCTCAACAAAATNANGGA 
AT T G AN A C N TAN C N AAA 

SEQIDNO306 

ACNANTATNNGAAGGTAGAGNGTNTGATGGGNGAAAAACGAATNGGGACNGGGGGTGCNTAA 
ACNNNAGTCAGNTNGAAGAANATAGA 

SEQIDNO307 

GNTNNATNAGCACTCTGTTGTGAGGTAAGGGNCTGGTGCCCCTCGGGATGTANTTCANTATN 
GCCGGAGAT 

SEQIDNO308 

NTTTNGGGTGACAAGTCTTATGTCTCAGGAATAGCGCCATTCATNGGTCGCAAAAAGCTTGA 
AGAACAAACTGNTCTGATTTTGTTCAACTTTTTCTTCTAT 

SEQIDNO309 

CACCTNTCAACAGCATCCAGCNACTCTAANCGCNAGJ\AAAACANCCGNGCCTNCATTGAAAC 
CNCCATTTTGCTTTTGNTGNTCGAAGCNCTNNTCNNCAGATCNCGATNCTG7\AAN 

SEQIDNO310 

CCAAAGTNTCCGGCTCCANAGGGTTAGCAAGNGGGANGATGGCGTNGGGNNAGCGAGAATGA 
AAGCCTTCATNATCCCANGNAGAGAACA 

SEQIDN0311 

CCCATTTTCANCNACCNAANGCAGCCTAGGTTANAACCTCTNNNNNCTGNACAAGCANCAGG 
CTTTAAAGNTGNATGANTGAGGTCGANNGCGANCNTCTCAGNTNTNCCAGTATCCTCGCGCC 
TGAACCTA 

SEQIDN0312 

ANCCTGCNTGTTGTAACCGCCTGGGNTACTAATTGTATNANCTCTGCTATAAATTTTTTTAT 
SEQIDN0313 

NNNNCTNNNNNTGGGGANTAGAACCATTTTGTTCANTTCACTTTAGNNTTTGTNATGNAATG 
AAATAATAGCTATATCCNTNNNNTGAANNAAATGATGGCTGNTGCTGNGGGG 

SEQIDN0314 

CTGTTTTGGGNGNCAAGGATNNNGNCTGAGGNNNAGCGCCNNTCNTTGTTNCGCNANNAGNT 
TGCAGAACAAACTGNTGCTGATTATGCANAACNTTGCCTNCTG 
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SEQIDN0315 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0316 

NCNTNAGATGNNTAAAATGGTGNGNTGCTTNGGCTCTAANGAAG'NNGGGGNACT 
SEQIDN0317 

NTCGTNNNNNNNCTGTGTACTGNNATATGTGTCTGNATTACTCCTGNTGTAATGCATTGACT 
TATACGGGNCTTGGG 

SEQIDN0318 

TNGNTANGCCCCTATTNGTTACAGGATCNCTAGTTTCCCACANAANATCGNCCATNGC 
SEQIDN0319 

TTNTAANACCCCATNNTGCATCTCACATAATGGACCGGCCANCAATANGTGAATTAGCTGGA 
TGATATTCAAACGAAAATTCATCATCTCC 

SEQIDNO320 

TTGNAAGCCCTAGTTNTANCCCAGCAGGGGCTGCTCCTGAAGGGCAATTTTACCCACCTTAT 
TATCCACCCTATGGGTATACGCCACCACCTCTACCATATCAACANTATTATCCTCAACCTTA 
TCAAGCTACAACCCCACTCCACCTGGTGGTCAGCAAGCCACAGATCAGCAGCAGCGGCACAA 

CAACC 

SEQIDN0321 

TTTGTAGCCCTAGTTGNTCCCAGGNGGGGCTGCTCGTGAAGGGNAANTNTACCCACCTTATT 
ATCCACCCTATGGGNNTNCGCCAGCACCTNTACCATATCAACAGTATNATCCTCAACCTTAT 
CAAGCTACAANCCCACTCCACCTGGTGGNCAGNAAGCCACACATCAGCAGCAGCGGCACAAC 

AACC 

SEQIDN0322 

TGCCCAGGGAATGGGTATTGGGNGCAGTTGTACCGGGAACACTANATGACTATCAAAAATGN 
GCTTCACNGGACA 

SEQIDN0323 

GCTNNGGAATNGNANTGGAGCANNTGNACNNGGACACTACATGACTATCAAAAATGGGCGTC 
NTCACGACA 

SEQIDN0324 

TGAAACTATGTGCAAGAATTAGTCAGTTGACAATAATTTGATTGAGTCTTTCAATTCTTAGC 
ATTTTGGAAGCTAGATACAAGCCTATGA 

SEQIDN0325 

ATCNTGATNNTCGGCCATCTGGTACNTGGAANNGGCGCTGGTGAGACTTGANTCTNGNCAGA 
GGNGGACCCCNAGCCACGAGCAGGATGCTGCATTANCATTGCNATCAGCAGTATAGGAATTC 
TCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAGCCA 

SEQIDN0326 

GTTAATACCCGGATGTGGAACAGGAACTTCAGTCTGTNNGATAAGAATTACCTCTCCAGCAT 
CCAGGCTCAGCAGACTCTGCATCAGATTTCTTCACAATTCAATGGTGCT 
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SEQIDN0327 

TTTTTNTAGCTNCTAANAGCCCAAATTTCTCCGAGNCCAAAACAAGGTCAAGGTCCAAACAG 

TGAATTGGCCTTGGAGCAGGGCGTGAAAGACTCTGATATAGATGCTGCAAAAGTTGCTGCAT 
TGAAGGCTGCTGAACTAG 

SEQIDN0328 

TNNGCNGATNNTAAANTCCCCTCTTCGACGACNACNGCTNAGCATGCNTNTGTCTGANGAGT 

NCTAAAGGCTGTTNCCAAATTTACTAGNTCTTGACATGCGTATCTAACTGGANTGATTGGTA 
G ANT AT AAAAN T GN G AC AANN N G TN T G AC TNG 

SEQIDN0329 

TTTAAAACCCTTNTAAAAACCGAAAAATGCTTNTAAAAGGGTCCAAGGCAGAGACCAAGAAA 
ANTAACTGTTGAAGANCGGANAGATGGAAGNAANGTANAATTTTGTNNAAGGATATGGTNAN 
GATTGTTTTTNAAGAGANGNCGNAAAACNAACCCCAAAATTCCTCCAG 

SEQIDNO330 

TTTTAGCGGCTNCTAAAGCNCGGACTAAGAGACCNTCNGCAAATGGCNAGGNTTGCNAGGTA 
ANNGCNTGNCNNNCGCNANTCNNAGTGCNCCCTTCNATNTTAGTACTNTTNCGNATTNTTAG 
ACTATNNANGGNGANAGTAGTACNGACCGGAANANGAGGCTCGAGACTTGTGACACCAGANC 
ANANTGNGCTACNCCCCCGCTAGGTATTGTACNCTTCCNNATGAACNTNNCGNTGC 

SEQIDN0331 

TNCNNNNNNNCTNCCGAGCNGNTNTCTCTGACTTAGGTNTATATTCTAGGAACTCTTCAGTG 

GGAAATGCCGTTNAAATTATGATACTAACTGTTAAGGTAGGAAAGATTACTGGTTGACACAG 
CATA 

SEQIDN0332 

GNNNNTGTNNNNGGNGTGNNCGCATNGGGTGAGTGGAGTTCACNAGNNTGGGNNACTGAAAT 

TTATAGAGACGCTANTGAGGGGGCGGAGNGGCCNNNNTCNNATTCNGACNTTCTNGTGCCNN 
ATNACNAT TAGAAGNA 

SEQIDN0333 

CCCNNNGNNNNCGNGGAATTGCGANTTGNAAAGCAACNTGTTGTCATGNAGAGCAGGAAACA 

AAATNTCGTATCTCGATCTAGANCNTNAGCACANTACAGANNTATGNNACAGGCTGTGNGNG 
AGGTANTCANNTATCGGTTTGTA 

SEQIDN0334 

GNNNNNCNNNNGCNGNTCTGTGGTCTTGNCNTTGGANATTAAGCNCCTACTTNNTACGNTAC 
TGNANNAGNCNGCNTCTANGAGCAAGCNACNAGCCCTACTACTANATTNANCTACTGCCTTT 
ATGTNTAACAAAGNNNGAGCAAGANAGGACCAACAGATGCTACTAGCTAGAGTTGATCATA 

SEQIDN0335 

TAAANGNNNNGNAGCAAGGAAGCTCTAGCTTGAAGGATGCTGATTATNANTTTTGATTAGAA 

TTTTACAAATGTAAAGAATTATACTAATGTAAAGAACTACGTTTGGGCTTGATCCCCATAGG 

AGCTTAGCCCGGGGTACGTAGGCAACCTGTGAGAAAAGGAGAGATCAGGTGCAGCCCCTTGT 
A 

SEQIDNQ336 
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ACNCGAATNGNAAAGGAACCCGAAACTATGANTNNNAAACTNGNAATTCTTTGATGCTACAA 
ATTGGCACTGNATNG 

SEQIDN0337 

GNGTCCAATTNNGGTTTACGTGTTACTNTNGTTTTCCCTGCTCATACTAAGCTGTGAAGATG 
ATTTAGTGCTATTTGAGTAGCAGTGGTTGTAAGCCTTGGGA 

SEQIDN0338 

GTTGTGTNCCATTCNGCATGCTNTATTACATGNGTTGTATGAGGTGNNACTGATCAGGAACA 
CTANATGACTATCAAAAATGTGCTTTACGNCA 

SEQIDN0339 

CATTGTCTTCTTTTTNTTTCTTCTTTTGGCGAATTTTCTTTTGNTTTCTTGA 
SEQIDNO34 0 

CTGGGTGTAAGTNGAAGAAGGATAATGGACAAGTGATCCAAAGCATTATAGGGACGACACTT 
TAGGCA 

SEQIDN0341 

NGGCCGGAGTGGGTGNGGNGANGANTGGATCGTTGGTGAGTGGTGNGTNNNC 
SEQIDN0342 

GCCGCATACATGCATATCCGNGGGGCAGCAGGATGCGGGAACAGTTTTTTNATGGGNACCCC 
TANTGCANGNNCN 

SEQIDN0343 

AATCCNNGTNTAAGATTNTCAGCNTTGGGCNAGAGNAAGCNCTAATCNTGATNANCANTGGT 
GAACCNAANTANCCAGTTACCACCT 

SEQIDN034 4 

GCTCTTCTGTAAANGGTTATTTTTTGACTGACANNCAAGGGGGTAAATTTTTANTTANNACC 
ANAANTTGNTTAAGGNNN 

SEQIDN034 5 

ACGTACACATTCTCCTCAATTGCTCAGGAAATGGTATTGGGTGCAGTTGTACNGGNAACACT 
ACATGACTATCAAAAATGTGCTTCACGACA 

SEQIDN034 6 

GGTGCGATCGNCTGCCGAAGAAGCGTTGTACTTGNAAAATATCGGAGGAAATATCCCTGAAA 
TAACTGCCAACGCTGGTGCAGNCAAAAGGTACTATGTTCGNTCTTNNATNTAGCA 

SEQIDN0347 

NCNGTTATAGTCGANACACANGGNATGCCCTCTNGNAAACATNTATTGTACNGGATGACGTA 
TTCTGATANTNNCTTCAAANAAAGANNCATCACTAGNGAGCACGAAAGATAAGTGTNTTNTC 

T C AAAGAAAT G AC C A 
SEQIDN034 8 

TNGCCGNTTNCCATGNNGNACNTGGATANTCNAANNCTNTCCGNNNGNGCTCGNGNNTANNG 
NCCGGCNANACACCANNCCNACTNTNTGTGACGCNTGNAGGACNANCTATGNTGGNAGGANT 
TNATAGNNNGNNCCANATCNGCNCTNGACAGNCACTNNCCTGNGACTNCNNTGNANC 



FIGURE 4 (continued) 



99/140 



SEQIDN034 9 

TAGNCCGCTNGTTCAAGAGATTNNGCTCTGGCATCTGTAAGTGAGATATCAAAGCGCACTTC 

TGAAACCCCTCAACGAGAAAATAGAAGGAATNCAACAAAGATTGACCAACCATTCTGTAGAA 

GCAGAACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTTGAATCAATAAACGAGAAT 
GGAAACAGAACATGTTCCCGTACTGNATTTTNTCCTTTCAGC 

SEQIDNO350 

CCCAAANTCCNTCTTNTACGATTACTCAGGAACNNATNATGNGATTGNNCTNGACCGAANGC 
CTTNTNCGTGATTACCTGGAAAAGCTGCAGCTGGACA 

SEQIDN0351 

TACTATGTTATTGTTCGTCANGANANTNTGCAACNGNTGNCCCA 
SEQIDN0352 

GCGAGGGCCTCCCAAGNTGAGTNTGNAGCNNGGNGTNANGNAATNAAGAGNAGAAAGAGGNT 
CANGCGGNNGAAAATGTAA 

SEQIDN0353 

GCGCGGGACCCTACCGAANGGGTAATTTGNAGCAGNCTCGTACAAAANATAGGAGGAGTANA 
ANGTAAGNTCNNGCGGAAGANNATGTAA 

SEQIDN0354 

TGTGNTTCTTCCTGTTATGGGGACTTGTTGGTTATTTCCTTTTTTGTGAAGCTCTGGTCGTT 
AC C T C AAAG T G T AT GT AC T T C CAAACGG AA 

SEQIDN0355 

ATTGTCTTCTCTTTTGGTTTCTTTTCTTTTGGCGAATTTTCTTTTGTTTTCTGCTTGA 
SEQIDN0356 

AAGCACTCTGTTGTGAGGTAAGGCCTAGTCCCCCTCTCTATGTACTTNATTTNTGCCATACA 
TTT 

SEQIDN0357 

GGATTCAANCCATCGAGGGTCCATNGTGGTCTCCGGCTTACGGNCTATTNGTGNTCAACTAT 
TNGGTGGNCCGCATNNTTCTTGTANACTANCGGGAANATCT 

SEQIDN0358 

TGCTCAGNTNGATCNAAGGGGNGTNTTTTNACATGGAACAGGGCAACTGCCTCTACTTGNTT 
TNATGCCTTTTTCATTNNGTNCATTTCTAGGGATCGGCCGT 

SEQIDN0359 

GATNNNNTANCCNNGGNCTATNAACGTTNCCGANGCAGGTNCGCNATGCTNTGNCCTTATNN 
CATNGCGAANGAGTACCNGGANANCCCNCNTGGACANACNTGAGGGCAGCCATGGGNAGGCT 
GANACAAAATTCTGGTTCACTAATTTCCATCTTTNCTTTTTNTTTATNNGCCAACACANTAA 
CTNTATTGGTACTAGAACATGGNATTACCTTTGGGT 

SEQIDNO360 

NACGCAGNNNAAANACGATGACGAAAGNCCGCCAAAACCACTGACTTGACACNTNNAAGATT 
GCTNGGGANCANAGGANGCN 
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SEQIDN03 61 

TGTNNAAANAAGGCGTGCCGAGGCNGACGGATGTGNCANGTGTCNCANGACGATGTTACTGA 
ATNGGTANTTACANCGGGAATCTGTGGCGNTCATGC 

SEQIDN03 62 

AAGNCGGAANGTTTGTANCCCGNACCNCANANAAATTCACATTG 
SEQIDN03 63 

GGAGCACAGCAATTCNAAATTCTTTCTACCATTTTGGTTTCATATCTAAGTCATTCCCTATT 
GGGCTTGCGCT 

SEQIDN03 64 

GTGGGATGCTGACNNTGNAGCTNTTNGTNTNGTNCCNNAGNNATTNCNNGCNATTAAGCAT- 
SEQIDN03 65 

CGCTANNNTAGCANTCCGATGTGAGGGANGNNNCNAGNCCCCCTCTTTATGAACTTGANTGC 
TGGCATACA 

SEQIDN03 66 

GNAAAGCTAANGTGANNATTAGCACTCTGNTGGGAGGGTANNNNCTANANTCCCCTAANTAT 
GNTACTTAATTGGGGCCGTNCAT 

SEQIDN03 67 

GCCGGCTNNTGNAGNGNCGNTGCTTNNTTNAGTNTNNTGAGCATGGNCCTNNAGAAAACGCT 
NGTGGCATGATGCNTNANGGGGN 

SEQIDN03 68 

NNNTATCCCTGCTGTGAGGAGTGTTNTTCCTTGTGTNATGCCTNTATTTGNGTTTCCGCNNT 
TGTGCTCTTNTCNTAATGTATAGATTNTNACTGTAGATTCTCAT 

SEQIDN03 69 

GNCNGGTNGNNGAACTAAAGTAAGTNGGTAGGCATGGTGGCGAATGAACCTAAAAAGTAAAA 
T C T AAC T T G C AG G AT C AAAC AT AN GN T C A 

SEQIDNO370 

CNCATTGTANATCAACCTATATGATGGACTTACGNGAAGTTTCCAAGACACATGACTAAAGC 
TGACCAAGTCTANTAGGCTAGNTCAAGCCCGTACCGTGACA 

SEQIDN0371 

TCGTATTTATGCNCATGAATGATGTGCAGTGNTGTGTCCTGACTNATNGGAGCCGTTGTCAA 
ACATGNNGTATGAGTAGGAAGNATTNNCTGCTCNTCTCGGNCATGNAGGNAGCCANATNNGT 
CNGNNAGT GCAGAT 

SEQIDN0372 

NCGATNCNNANGACNCANNNNNGCGAGGTGNGTAANANTTTGNNACCTTTANTNGCTGCACT 
ANGANATCGACNNGCNCNGTGANNGNNNNACNTGAGGAAANCANAGCNGGAATGNCTNAGTA 

SEQIDN0373 
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AATATGGAACTGGAATTATGTATCTGTATTACTCCTGTTGTAATGCATTGACTTATACGGCC 
TTGG 

GCTTATAGTGCTGNATTTATGCTGATAAATTCTGTAACATAATAGTGAGGTTGTAATGTAGA 
TGTTGAAGAGCTACCTG 



SEQIDN0375 

GNTNAAGCAGNGTNGNTAANAGGNNGCATTTTCTAGTTTCAGATTTTTCTGTTCTTGGAGCA 
ATAACATCCATCTTTCTCCT 

SEQIDN0376 ^^^m 
CCGTTNCCCTCAAACACCCTTGAATCCTATCGAATCTGGATTTGAAGACGAACCCTAGAAAT 

TCCAAAATCCTAAATCGAGTGTTCGTTGAATTTTTCCAGTCTAAATTGATTTTATTCGTGTG 
TTCTTG 

SEQIDN0377 _ m „,, m 
TTTGNGATANNTTTAGTTGGATGGNATGGAATGCTTATCTNNTATNCGAAANGATGGT 

TTGAATAACNCCAGNATNGGCNNAATACANNCCCTAATANCGAATGATCTGGTATTTTACAG 
GNCTGACGGGGGGNCGCCCTTTTCCGTGN 

SEQIDN0379 

TGCTTGTANNANGCCNATGCTGTNTGGTGGNNCGCGCACGTNGTGNTCNNNTGAGAGGACAT 
NTCTGANTTGNGCCAGGNNCCNGANGAAGACTNCCGATANTTANTGCCGAGGCNCATGGGGG 

SEQIDNO380 

TNTAGACCCGTTTTATACAAAGCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCTGCTTG 
ACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAGGTGGA 



SEQIDN0381 

NGCACGGCCCTCGGNCTTGCAAAAANGTGGNNACACCCTCGGGGNCNNNGCCAGNGGG 

SEQIDN0382 „,„„ m „ 
ACTCNANNCCCGCGTGCTCGCGCCAGCTCCCAATGCAAATGGNATAGAAAAATNCAATGCTG 

AGCATCG 
SEQIDN0383 

NNCTNNAATGTAGCTAGTACAAGTGGNAGTGNGCTACACAATATAGCTTGACCCCGACAAAA 
ATNCTNCACGCACTAGNAACTCATGACATGGTATACG 

SEQIDN0384 

CGNAGCNCGNNCGNACACNNCGACAAAGGGANCGNCACANCC 
SEQIDN0385 
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GCCCCCTGTNGCTGCTCCCTNAGTGNTNGGNCATNCAGTGGTAAGCATATTGGCCTGCGCCA 
GCATACTCTAANCATGGTNTGNGATAGAATTCCATCACGCTACTCTNGNGGCNCATGAAGAG 

CATATCCG 
SEQIDN0386 

NTGTAGCTTTCTNTGTAAGCTTATGTACCTANNNGNNCCTGCACCGCCCATGGCTGCCGGAT 
CTGATAGCTCCCAAACNATTNGTTTCAACCACAACCCAATTCTTGCCCCAAAACCAACCACA 
TCGTAGCCCACCAGNTNTGTTCTTCTCTCCG 

SEQIDN0387 

NATNATCTCCGTGAGAAAAGACNCTAATGANTATNGNTTAANCTTATGCCCTATACTCATTC 
GACGACTNACACTGNAATAAAGCCGAGTAATNGCAAATGCATTTATTTATACTACACC 

SEQIDN0388 

TTGANTACNNTNNANTNCNGNCCTTCCNTNCAAACAACAGNACNNTGAGAAGCCATAAAAAT 
ACAGCTAG 

SEQIDN0389 

GTGTNTCCTTGTGTNATGCCTNTNTTNTTGTTTCCGCTATTGTACTCTCATCATAATGNNTA 
CCATTTTTCTGNAGATTCTNA 

SEQIDNO390 

CCTCACGTGGTCTGGGACAGGGNACCNCGCTGGGCTGGGGCATNTNANGGCTCATATCGTGG 
CAGAGGACATGGCACTACACGAGGTGGTCGCGGTCGTGGAAGTAGCAGTTTGGGGCCGTGTC 

AG 

SEQIDN0391 

GGCCTGGTCNGTGTACTTANACAAAGTCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCT 
GCTTGACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAG 

GTGGAG 
SEQIDN0392 

CTTAGCANCACAGCTGCTTANCACAAAGATACCAGCCCAGGGAAGTTGAATTTGNNTGTCTA 
CAGCNAAAGCCATTGCNGANGNAAAGCCCCTNGTTN 

4 

SEQIDN0393 

TACAAAACGTNTTCATTCTTTCNANTAAATCTTNTATTNTTATNAGAGACATGGGTNGCCCG 
TTNGANGGAGTACTGNTGTTCTTCCTCNNGNTNAGTTGCNGAATATTGCANTNGCTGC 

SEQIDN0394 

GCTCTACAGAGGACAAGNACTNATATCTGNAGACAAGAGGGAATTGCAGCACTCANGATGTG 
GTAGAACGGACAAGGGAGTTTCCTCTNNTGNTCAAGTGATNTCTCTCTTC 

SEQIDN0395 

CCTTNGNTAGGCCGNCGACCTTCAGGANAACCTCNNTNCNGGAGACCGTNNCTNTCGNCNTG 
NTGATGGCCATNNNTTNAAACGNNTTGTGATG 

SEQIDN0396 

TGTAGTGAGGAGANTGAGGCTGCAGATGAGGTGGCTGGTAAAATCTGTGATGAATTTGATTC 
AACGGTAGTGAATAGTCATGTCAAAAGACTACCACTTGCTGATGTAACTGATTCATATCTGA 
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ATCTTCCTGCTTCAATCTCTGCAGCTGAGAGGTCTCATGCTAGGGGAAGTCTGGATTCTGTC 
AAGACAGATGCTAGCTGCACTGGGCATCATAATAAAGCCAAAAGAAAGCTTGGAAGTAGC 

SEQIDN0397 

GCAATCTNAACTCCCGACTTNGNTGNGTNCTGATNTCTGCTGTTGAATCGGCTGTTTGGTGG 
CTGAACTCAGACCATCATTTTGGTCCATTTCTTTGATGTTGTTCTGCTTGTAGTTGTCCTGA 
AGTATTTATGGAAGTTGATTCAAGTCTAATAGTGGCCTTTACTCTGCATTTTAGCTGTCCTG 
AAGAATTTATGGAAGTTGATTCAAGTCAAATAGTGGCCTTTACTCTGCATTTTAGGTACGTA 
CAGGTCAACTGTAATTCTCTGTTGCATTTCTAAATGAAAATATGGGTTATCTTGTCATGTTT 

NGNG 

SEQIDN0398 

TTTTANGCAAGNNTNNCCTCCCANGAACAANCCCTTAGTCCAGNTTCAAAG 
SEQIDN0399 

TTTTANGAGNAACTAAATCCCCTTNTNCCGANCCCNTGCAAAANGNGGNCTANACNGNNNNN 
NTGAGNGNNNAATNCNAANATNAAACNCTGCNTTCATTCTTTNCCTACTGATATGAGACTGT 

CAATNCTGNCAGGGCAC 
SEQIDNO400 

TTNTTNGGCTCGTCAGGGNGATTCTTCCTGCNTATGCTGATNATGAGTTGACCGATGTTCAN 
TGTTNNNTAGANCTGNCCNAGTCCNGGCAATGTNNCAAGTATATAGTGGCACTGCNCGGTNT 
TATGNCAACATCAATNCTGCGAAAAGCTTCACC 

SEQIDNO401 

NNNCACTNCTAAAGCNCTCTCCTAANGACCCCCAAGAGGANGCNTNTACTAGACATNCNACT 
CAGGCGNGATCCGCANNCCTGANCCGCGTATAGCTGGTATGATNGGNCANCCAAGGATTNTG 
GNNTACGAGGGCCGTTANGTGNGANANGCACAATGNNGGACAANANNTGNACCTNANGNGNN 
ACAACNCAACCCAAAGGCTAACTATGCGAACCAGACACACCTACTAACGCTCTACTATGTGN 

CACAAGCTGTGCGGTACGACAAGGC 
SEQIDNO4 02 

NGGCGTGGTGGCTGNAANGGGTCTNANGNTGCC 
SEQIDNO4 03 

TTCTCAGGNAGGCGGGGGTGNCATCNCTGAACACCANAGGCAGNTNNCC 
SEQIDNO404 

AACCTCTTTTCTAGNAACCACTCTCTNAATNTGTGGTNGGCGNTTNCA 
SEQIDNO405 

ACCNCNANNCNTCNGAGGGANANGCCNACNTNNTGGCNGTGGGCCCGGAANTGTNCNAATAT 
AA 

SEQIDNO406 

TGCCGGGGTTNTCNNACAAGAATGCCNNNCNCTGNNNCGTGTNTGTCTGNNCNCATATGCNG 
GANANGNNCNTGNCCNAAANNNGNCATNGTGCCTTNCAGTAGNATNANCNGATCANCTNTNA 
GAGTNNCCNNNNCAGGNNNNCNNCAGNTNGNTAGTGTNTNTGCTNTNGATNTGACCTTACTA 
TAAANATGAANCGGCACNACCATAAGGTATAAATGTAGGCACANTGCTTGCTCTATA 
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SEQIDNO4 07 

CTCGGTGAATGCACCATCCTCANTTCAAAGTGGTTGCTATGGTNTANCAGACANCATATCGG 
TNACANNNTNCGAATTGAACGAAGAATTTGGNGGTAAACTNTGTCAGCAGAGCATGAATGCT 
GGTTTGTCTAGTGGAGTTGAGGTTATTGATGTTTNTACTCCTCCATGCTACAAGGTAAGTGG 
AGACAGCAAGAAAAGAAGACTTTCTACGGCTTGTCTTGAAATTATTGATTTGACAGACTCAC 
CTATTTTTGTCTGATGTAACTAATA 

SEQIDNO408 

TGANAGAATGGGTTCTANTNAGGAACNATGTNTTGTA 
SEQIDNO4 09 

ACTCNCAGTTGNGNGGTGCGNAGTAAACAACTAACAAGANTGCGNAAGCATTCANGAGGACC 
CACTGTANGCTTATNNCATCTNGATCAAAATCAGAATGAAGTTATTTCTACTCTTG 

SEQIDNO410 

TGGNAGCGCCGCGTAGCGANAGGNACTATAGCCTGGGGTNGTATAGACACNTATNGGCTGGC 
ACANCTTCTNACA 

SEQIDN0411 

TNNCATTGAATNGCCCTACATNTACCAATNTGNAATCNACTGATACTTCTCAAAACATATCA 
NTGNCTTGCCCACTTCATTACGGGNTTGTATGANAANCCA 

SEQIDN0412 

TTCCCGGCCTGGTTNCCCTACTNATACTCNACCATACCCNAGAAACCCNTAACCTAATTCTT 
CATTNNCTCTCCNCATATCATCNTCAAATACTCTNTNCACANATTCGTTCCTTCTACAACTC 
CATCACTTTNTCCCTCTCGCCACCGTTCCAAGTATTGCACATGGGTGANAGCTGNTTNATGN 
TCTGTNGCTGNGACAGATGAACAACACCATATCGCNAGTAATGGACTAGTACACAAAGAATA 

TGCTGNCC 
SEQIDN0413 

CTNANTGGCGNNATCAGTGCTCACA 
SEQIDN0414 

GTTCTTTTNGCAACTTTGATCGGGAAAGGGCTCNCA 
SEQIDN0415 

NTACTTCTGTTTTCTTTTTGTGTCAAATATTGTTTGAACTCTGGGTTTTCTACCACGTGCCA 
CGGTACCACTGA 

SEQIDN0416 

TGCAATAATGAACAAAGCAAGATATCAGTAGTGATATCTTTGTTTTAGAGCATCTTTGTTTA 
GCTGCTNTCCACTANCTACAAAATTGAATATTGCAACATTTGTAACCTTATTTTTATCTTGG 

CAA 

SEQIDN0417 

TCCCTTGTTTNATGGAGCCGATTACTTTATGAGAATGCTCAGAAACTTCAAGCAATCGAGAC 
AGATAATCGCAGGCAACGAGCAGCTCTGGTGACCTTACAGGNNAAGGTAGATGCTGTTGCTT 

ACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCA 



SEQIDN0418 



FIGURE 4 (continued) 



105/140 



ACCCCNNTNTAAAGGGGCCAAAGGNANAANCTGCAATCATTATTCGATTGAAACAATCCTGC 
GATNNANACNNGANANNCTGANANATGNCTNAANNNAAANATTTGTGCTGANNGGGGTGCTN 
TTCNNCATGAGGANTANATNNTNNCANCNNCTNAAGCTTCTTTCCATACTGGA 

SEQIDN0419 

TTGCGTGGCAGTTNGGGGCANAGGCACTGGAGACAAGGGCNACTCCAA 
SEQIDNO420 

AAGCAACCTTGAATCAGACTCCTCACTGATCTCTCCTTCTCGTCACTGTTTCTGTGTGTGTG 
TGTGTGTGTGT 

SEQIDN0421 

GAATATGGAAGATTCCGAAAAAGTGTCAATAGATGGC7^AAAATCACAATGGGCATGCAAAAT 
ATAGTTTCAAGAACACAAATCGGAGGAAGATGTTTGGTCACCCTGAAAAATTTAGTTCAGTG 
GAAACTGCGATGTCTAGAATAAAGAATAAGAGTCATAGACCAGCTGATAGTGATGGAGAGGG 
TGGAATGT 

SEQIDN0422 

TNAAANANAATGNATTCCNCTNGGGT 
SEQIDN0423 

GAATTCTTTGGTGTNCATGCGAATTACGCGTTCAGTTCTTATTGGGCTCACGT 
SEQIDN0424 

AGGATACAAAANCGAANCCNNTGNGTGNCTACACTGCNGAACTGCGTCGTTGCAGGGTCTTA 
TTGGGCTCAGT 

SEQIDN0425 

NAACAATTTGAAATAATATATTTCGTCAATGCAGCTTGCAAGCTGCAGAGAGGAGAGTCATT 
ATAGT7\ACTTTATAACTTTTGTTTCAGTTTACAAACCTTGTAAATTTTGACCATATTGAAGT 
TCTCCCTTCAG 

SEQIDN0426 

GCACATTNTCACATCTTTACTAANATAAG7\AGATTNCTGTANCATCTACTAAGATATTGCAN 
AATNNTATCAGCNAGAGTGTTGACGCCGC 

SEQIDN0427 

GAGACTTTCAATTGCGTGCNTGCTNTNANCAAGCCGCGAGACANTNCTAATACTNNGACNNG 
CTGGNAATGNGNCATCTNGNNNNNCTANTNAGANNCNNANGCNCACAANGTNNACTGTGTCC 
TTCTGGCTGATGNCTTCCCNAGCATTACGTGNTGTCTGCGGCCTGAATAAGATACTGCCTCT 
GCAAATCC 

SEQIDN0428 

CCTGCTNATTGGANGGAGCACTGAGGGTGGTACTNNTTGCAGGAAAATGCCTGTCNTNNGNA 
CNCAANTNCANGCCCGNNNNGCACGANGTNGATGCGGNACNANNGC1S1GCNTNATATCTGNNN 
NGNATCGNNANGTGTNACACGCNANNGA1SIAGCACCGGNTANNTNTTNNATCCTNTGCCGGTG 
TACCTTTGANNTNANANTCCTCNTGTTACCNGANGNCANGTGCTTCTNCTNAGCTTGNTANT 
TGAANTGGNGTGAGAATGAATGACCAGCNGCT 

SEQIDN0429 
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NAGNTNTAATTACTCGGNC 
SEQIDNO430 

TTTTANNCAGTATAANTNCCTNCCCCTTAAACCCCCCACTGGAC 
SEQIDN0431 

TGGNACTTTCTNCTCTTCAAAAGCTTTGACTCTCT 
SEQIDN0432 

TGTGCGCNNNGTGNATGTATATGTGGTCTNGGCTCTNAGNCTGNCT 
SEQIDN04 33 

TGCAGCTTGGGGAAGACCAGGATATGAGCGNCGGAGTGAGCCACTCCATAT 
SEQIDN04 34 

GACAACACCATCAGGTACAATGGCCAAGTCGCAGGCACTGGGAGACGTACCAAGTTAGGAT 
SEQIDN04 35 

TGAACTAGANATGTCATTCTATAGCNAGTATTCAGCCNGTGCTGTGTNTTANCATAATATNA 
AGAATNTTTCTNACTTACGTGCAGGGGAT 

SEQIDN0436 

NATAGAGGAGGACCCATCTGACTCCCGTCTTCTTCTTCATTAGAAATGGGAATCAACATCCA 
CGAACAAAAATGCTATCGCTAT 

SEQIDN04 37 

TCAGCCTCCCGGCTTTAACCTACTGNGGGNACAGNATGTNGGAAATNCCNGCNAAGCTGGNT 
GGNT 

SEQIDN0438 

GGNNATGCNCATTGGAAACACNCGAATGAAACGTTTCTNTGCGAAAGTACTCACCAACGAGT 
GCCATTGGAAAGATTTCTATATTGTTATGGAACGCCTAGANNNCAATACAGTGNNACGCAGC 
ATCT 

SEQIDN0439 

CATCTCGCAATGTNATCCAGNGTNAGCTAACNG 
SEQIDNO440 

TATAATCCNGCACTCNCAGGANCGCAAATAGNTGTGNNTGATGGTTATTNTNGTTATG 
SEQIDN04 41 

ACCNAAGATCCCCCNNTNAAACACCCAATCCCCCCTNTCCGGCAATGAAGCTGCCGGAGCTG 
ACATTGATCTGGCCGATGTTTTCGCCAAGTACTTGAACCAAGGTACAACAAATGATAATGAT 
CATGATCAAGATAATATTCTTCAAGAATCTCCCTTGGCTGATCAAGATTATTGTTCTATTGG 
AGCAAGCTTATCAAATTCTCCTTCATTAGATAGCTTGG 

SEQIDN04 42 

ACCANAACNCAANNGAAAGGGCCCCTACTTATAGNNNCCAAGGAGGAGNACAAGTTACTGAT 
TGG 
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SEOIDN0443 _ 
CTATTAAATACCTCCGGGTTTTAAANACCACNCGNCTATATTACCGGTTCCGAANCATTGTG 

CNG 

SEQIDN0444 

NCNCGGAAAGGCCCCCCTTNGTGGGGNAAACGACCCGGACTCTCNGGCNGCCC 
S EO I DN 04 4 5 

GTAAGGGTAAGGTCTTCGCTACAACTACAGTCGTTTGGTGGGTAACAACCATCAATAACATT 
ATCATCCTTCTCAATCTTAGCCG 

SEQIDN0446 

TCTNTGGNAAAGCCCNTGAGANATTGGGAAAACTNAACAAACNGNTAAGCAGCAGGAGANCC 
NACANGNNNAGNGAGGCCATTTTTTTNCGACANCNGNGATAACAAAAGGAAGCAGGNGGCAA 

ATTCGAGCTCAGACACNGAAAACCAGNNTCTNA 
SEQIDN0447 

ACTGGCNTNTGCNAGCGTTAGGTTGCTGGTTGTCCTTTNCTTTTNCACTATNNTTTTTGNGC 
TGTNNNTCTTCACCGTTTAGGGANCATTACCCAGTTNCAAAANCAGCTCNGTNACATCCGNC 



SEQIDN0448 

CTCGGCATAATCGTGCGTTATATCGCTGGTAGTCCGAAACATTCACAAGATTATTTTTCTGC 
TGATGCTCGGCATATTATATGGATTCTTTATTCATCGATATTGGCACTTGATATTTTCTGAG 

TCG 

SEQIDN04 4 9 
NCCGCAGAGTCCCTGCAGC 

SEQIDNO4 50 

GGTNTTGGANCTCCATTCTCTATTAGCCNG 

AAGGTGANGTCNCAAAGANNTGACCGGGGCCTGNNTNTGNTCNGNNNACAGGCATANCNGNA 
G AC N G AAG C G AN G AN G AC T N AAG 

SEQIDN0452 

CANGTGCAAGANTGTTCNTCGAATATTTTTGTATTATATANGCAAATAGTAACCCCACACCT 
ACTAGTTGTTTCTAATTTTCATTTTCTTTTCATTTGTTACTGTTTCGATTTTTTTCTTACCA 

TGTTGGATAAATAATGTGTTGACTATAA 

TCNNTGAGCTNNNTTGCAGCTTCTAGCNGANCTTTNTTTGCAGCGTCTNGCAGNNGNTTTNT 
NNGCCATGNTTGTTNTTCCTNTNCATAGCCCNGTGTATTTTTGGCTATGANCCTGCTCTAGT 

NTNCATCTGCCTTCAGCGTGAGCCTNGTCAACTACATTNTTCTTGGAA 
SEQIDN0454 

NTGNGGAGCATGAGTTTATTGCGTTTGATGGTTCACATGCTAAGTCTGAATACATTTACACC 
GTTTTAGATAACCTAGTCGGTCAAANACAACACATTACTATTTTTCCAGATGCTGATTCTTT 
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AGTTCTTGAGAATAGCTGAAAGTAATCAGAGTTTAGATATGCTGAACTTCCAATACAGCCTT 
AG 

SEQIDN04 55 

GATGGCAAAGCAACATTGNACAGGNTGAGGACTACTAGAATATTANANGCTNNTATTGGGTA 
GGNCATACGTTGGTNCTGTGAAAGGGAATCAATGCCNTGNTNTNNCTNGCNNGANNTNGAGC 
NTNNNGGNGCACAAATGNNCTATAANNAGCCCTNTNATGNAGGNGGAGNNCACAAGNGNAGG 
ANGTGATGCCNANCTGACCTAGCTTGTGTAACACAGGNTCATTGANAG 



SEQIDN0456 

CAAAGAGTGAGGAAAAATGGAAACTGATTGCGTGGTGCTACCGTTTCACACGGTATACATGA 
AAAGAAATCAAGTCAGGTATTTTGACAGTGAGGATTCATAACAAACAACAAGATCATATACT 
CTGAAGTAGCCGAATCCAGGGAGCTGTTGATCTGATCTCGATCCCCAGCAGCGTGCAGGTGA 
CTAACAAAGCTAAAAACCAACTCTATTCAAGAGCTGGAGGTGCTTCAACATAATAAGTAAGG 

GCTGTTCATTCTTGATTCTTTCAATTAG 
SEQIDN0457 

CTTTGCCACATTCTCGGCGNCACTNGTAAGTAG 
SEQIDN0458 

TGNGCANANAACANAGGACTNAGGCAAGCGNTANTATGGGGANNGGANCCNANGNGGCNCNT 
CAAGTGNANTC 

SEQIDN0459 

CCCCGATGCCTTCAGTAGACAGAAGCTCACTGCTGTTGCACCAATNTNCACCCCGATGACTC 
TGCCAGAGGGCGAACTAGTTGC 

SEQIDNO4 60 

NCCGGAGAAAAGGTCGAAACCGACCGTAGNTAGGACTNAGTTTCTCTTNCNGAAAGANCNTG 
ATCGGGCTCTAGNNCANAACCNNGGNTTTNAATATATAATAGANAAACTTCTTNNGNANGTT 

ATG 

SEQIDN0461 

TGAGGANAAAGAAGGNTACNGCNCTTNCCGATGNACACNCAGNAGGATGANCNATNNNACNG 
ACTCTCNATGCTGNNCGATGNCCAGAAGGTGAGCAACTGGAAGANTTTCTTCTGTTTTTNGT 
NCTTACATATNTGAANANNAATCANNNAAGTANGANCACTAAAACNAACCCATANTGGTCCA 

TAANCTNTNNNCCTN 
SEQIDN04 62 

GGGTAATTCAACAGTGTAGATTTTTTTCTAGCTTTTGTAGCAAATGAATTTTTTTGATCTGT 
TGTTGTACTGTATCCAAAAACAAAAATGTTGTTCAATGAAAGATGAAC 

SEQIDN04 63 

TCAANGAAGCTCTCACCAGTCTCCATTAGTAGAGTCTATAATTATGC 
SEQIDN04 64 

TTTGANNNCCCAANGAGNANCNCGNTGAAAAAGGNCCTGATGAATTCACCACCAATGCCTCA 
CAATCTTTGTGGNGGACTAAANTGTTTTTGCCTTTTNTTGAAAAAGCCTTTGCTCAGCG 
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^GGACCTGTTCCTTCTCGTTCCTGGAAAATGTTTGAAGACGAGAGCGTGC 
GGCTGCCNTCAGTCCACCCGGAGACCCAAGGTAGACCTG^ 

CTCCCTCTATCTCTATTACCTGTTTCATTTTCTTTCGTTCAAAAA^^^^^^^^^^^ 



ACATCTAMGACGGCAAAGTTCAAGAGACTTCAG^TCTGGCA^T 
GCGCATTTCTGAAACCCCTCMCGAGAAA^T^^ 

^Sttttwatngtggc^tgtanaagcgncaantgngatcggnncgaacgttctgag 

TGTANCTTCTTNWGCTCWTCNGNTGGNTGGGCAGTCTGKANTNATCAGCTGNCTTCC 

gaaSagaa^ggaagnatanggaagncgaaggagtgagcacaacgggagcagcatgnct 

CGN 

SEQIDN0471 ^,^«xirTraNTTTTGNATGTGAGTGCNTCTATCNTTTAGCZ^NT 

SSgcacgngcgggggagnggaggagkangtgnngggtt^cag^g 

ATGTTAGTTTA 

CATTATGCGGANTTACAGGATNANTACAACGACTNATCTGANA 
ATGCTGAAAAAGGAGAATAGAAGCCTTANAGCTCAAC 
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SEQIDN047 6 
TATCNCAANTACTGGAC 

SEQIDN0477 

NTTAGGTTAAGTACTTTATTTTGAAC 
SEQIDN0478 

CCAAGGAAAAAGGAAATCTTGATAAAGGACTTTTGGAATGTTGTTTGC' 



SEQIDN047 9 

AGNANCCCTGTTGTTTCATCGGATTCGGCTACTGCCTCATCAGAGTTGCTCTCAGATTTCGA 
CCGGAGTTTTCGTTTTCTGATGGATTTCA 

• SEQIDNO480 

TCGGNATATAATATCACCGCAAATGACCTCGACTCTCAAATGGCGACCTTGACCGCGAAACT 
ACAATGATTCAAACTCGAAAAATGCTCAATGATGTTCAACCTGCTTTA 

SEQIDN04 81 

GCCCCTTNAACAGCAGCAAAAAGGACAGCAGTCAATTCCCCTTTTCCCTAAGACTGCCAATG 
CCTAGTCAATCCATCATCTATCTAATCGGAAGCAGAAAATACCAAGGCTTCCAGAACACCAG 
AGCATTGTCACTGCAACTTGGTGGGCATTTTCCATTTAGAACTGACATCTGTTGAGTGAAAA 
TTTTATAGCGCACTCTTTGCACATCTTACTGGTCCAATAATGTTCTTCCAATTTGATGCTGT 
TTCTATGCTAATCCAAGACCTGTTTCCCGTCTCCT 

SEQIDN04 82 

AGACGCTGTNAAGTAATGAATTTCTTGAGGACGCTCATCGAAAGGACC 
SEQIDN04 83 

TGCCCTTTTNCCAGCCGTGTGTTGNTATTTTCGTCACAAAGNTTATCACAGGTCTCAAAGAT 
CACCAATNAAGAGC 

SEQIDN04 84 

TTCCCGCTNTANACGCCCTTATTCGAGTTTGAGGATCTGTCNAGGTCGAGTTTACGGCGAGT 
CAAGTTGTAATCTTGTTGTTTTGACAACGAGTCGATGTTTTTAGTCAAGTAACNCAATACCA 

AAGGAAATGGNC 
SEQIDN04 85 

CTANCGGNAAATCTCCTTCTTCACAAACGAAACCCTAGCAAAACTCCATCTNCATATCAGGN 
CGTTTCAACACTAGAGACCAAAGGAATGTCTCTTCAGCCAAGAGTCATGCCCTCCCATCCGT 

TCTGCTTCTTCACCATCTTCA 
SEQIDN0486 

CTCCTNCTTTTATTTTACCGNTAGCTGATATTGTTGCTTTGATTGGCTTTCTAAAAATTGTA 
AAATGCATATTTACGCTTGAATTTTCAGAGATGTATTTTGGGTGATTGCTTTGTTTATTTTG 

AGAAGTAGAGATATTGAATTCCACC 
SEQIDN04 87 



FIGURE 4 (continued) 



111/140 



AGGAAAATNGTGAGAGCAAAATAAATGAGAGAACGAGGAAGAAACAGATATGGATATGAGAA 
AACGATNCGCTTTTCTTCTTTCCCATTCACCTGAAACCAAAAACACCTCTCTCATTTTAGCT 

ACTGAAACAATCACCAAATGTCACCTAAACAACCAGAAAACCTCCA 
SEQIDN04 88 

ANNCCCNNTTTGAGGGANNNNGGCTGGGNCTGATGNGTGTGATGCTACGNACTTANGANNCN 
ATGCNGAAAAAANGTATATCTACGTNGGANGGCCNTTGNTNCCTGGNGGCGNAGATGNCGCN 
ATTTGTACTTAGACACATTTCAAAGCATGTTGGCNAANGGAGATTGNGAAANTNTTGNTGTN 

AAANTTAGTCNTNAGNGTTACC 



SEQIDN0489 

TNCCCGGTTNGTTAAGNGACTTCAGCTCTGGCATCTGTAAGTGGGATATCAAAGCGCATTTC 
TGAAACCCCTCAACGAGAAAATAGAAGGACTAGAACAAAGCTTGACCAACCATTCTGTAGAA 
GCGACACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTCGAATCAATAAACGAGAAT 

GGAAACAGAACATGTTCCCGTACTGTATTTTCTCCTTTCAGC 

SEQIDNO490 _„„ mm ,, 
TCAANTGANAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGGGAAAAGCCTTA 

CAGGAGTAGCTTCCGAATGGTTTATGGATCAAGACACGTCTTGTCAAACAGTTCCAATACAA 

CATTGACATTGCCCCAGACCGCAATTCCCTTTCAAACTTGAAGAAGAAACCAACTGAAAGTT 

TCAGGGAATATGCCA 



SEQIDN04 91 

GGAANCGGGAATTCTTGATAAAGGGACTTTTGGGAATGGTTGGTTTGGC 

SEQIDN04 92 ^ 
GCNTTNCGGAATTCCTCTCTCTATATGAGACTGAAAGACTATGTTCAGGAACTTGCTAAATT 

T G AGAT T G AT AC AC AC AACATTAT AA 

SEQIDN04 93 m „„ 
NNCTGGTAAGAAATAGATGGTGACAGAAAANNTTTNNGGNGTTACGNTNGANGATTCATTAN 

GGGNGANAAANACCAATTTTCCTTTGNCTTCTGTANTAA 
SEQIDN04 94 

GGGCCTTTAGGGAAGGATGCTTTGTTGGCTTATGGTTATGA 

SEQIDN04 95 mrTim 
TTNNCTCCANTACGGAAACAAGCACCGGCTACCGAGGACTCCNATATGACACGAGAACTTTT 

CAGGTTTGGCGCCCGTT 
SEQIDN04 96 

GGTATGGAAGAGCTCANNCNAAACGNGAGGAANTTTNNGGAAAACAATATGGAGCNTCAACA 
TGGATAGGAAATGTCAAANGCTTGGGCGCT 

SEQIDNC-4 97 

GTCCGAACACCAAGAGAGAAACCCAGTGCCAATGGAGTTCAATTTTCATACTGAAAAGAGGA 
TTCATCATAATCCGCAATTGATCTGT 

SEQIDN04 98 
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GAAATACACNATTTCNAGCTGNNCCCTNGAATGGATGCCAANNNTGCTAATGCTNGNCCAAT 
GACNGTATCGANAANANGTCGCACACNAGAATTGAGGCTNACAGGGATATGATTACACCTGT 

TGGAGACGCTT 
SEQIDN04 99 

GAACANTGATGNTTTCCTCNNGGNNGGCTAAGGNNTNCNCCNACCCNGACAGGGCNTGGATT 
NNGGTTCTTNTTTCNNCGNGTCCCNNNNAATCTGACTTTGACTACTAAGAATTNCATACGNG 

TGGGGT 



SEQIDNO500 

TTATGTTTCTTGAGTGTTTTCTGTCTGTGAAGGTTTAGCTCACACCAAGTTTTCTTTTCATT 
TGCTAACACCAATGTTCCCACTGAAATGTGGGACAAAAGTAGGAAGCAAAGGGTGAGAGCTG 

CT 

SEQIDN0501 

GCGCCTTTGTNTATAATGCACCTTTTTTCTTCTGAAAATATNCTCCTGATGATCTTGCTTTG 
GCNCTATGAATTCATTATTGTTTGffGNTGAATTGGCTAAACCTAGGGGTACCAACTTTTTAT 

TCCTGAAGTGGTGGAACATTTACCTATCTTGTTT 
SEQIDNO502 

ACTAANCNNNCCCCATAACTNCGNTTAATNTACATCAAACCTGTACTCTCTCCATGTAATGN 
GGTTGTNAGATCACTGTTCTCTATACGAGGCTCATTACATACCGAATATACGACCCTCTTGN 

TTCTCTTTTGGCTGT 
SEQIDNO503 

NACNGCGAGNGATACTCCNAAACNGNAAAAGAACTCCGGAACACGCNTGGAGCANGAGATTT 
TTTTGAGCACACAAGGCGGAGCCAAGCTCTAACAGNCNGCANGAAGGAAGNGATGCATGGTG 
AGAGTACAGGCGAGAACACATGACATCTNTAACATACTCTCACATAANCTNGAAACTGACGT 

GTNNNACAGAACTNAATGCT 
SEQIDNO504 

NATCCTCCCNTCNAAAAGCCCGGGTTGCCAGGGNTTGACGTCTGACCGATTTGCAGAAGTAT 
CATTGAATGTTGCTCGTCATATATCTGCAGACTTGGAGAGGNTTTACCGCAATGTGGGGGGT 
CAGCCGCAGGAACAAGCGCCTTGATTACAGTGATGCTGGTGGATTCTACTGCAGAGATCAAA 
GTCTTCTTTAGCTAGCAGTCCTTTTGATTATTCTTTTGTTATCTTTGAGTTTGTAAGAGTCT 

NCTGNTGTTTTGATCATGNTATTTTGCCTTTTATTT 
SEQIDNO505 

TTNTGGAGAAAGGNGTGTAATGNACATTGTGTGTANGCACAACATGGATTTTGT 
SEQIDNO506 

ACCTGGTTGTTCCGANCCACCAAGAGAGANNCCACAGTGCCNNNGGAGTCCANTTTTNATAC 
TGAAAAGAGGATNCATCATAATCCGCCAATTGATCTGT 

SEQIDNO507 

TGTGGCAAAACATGTAAGCGAGCAGCTAATCAACAAGCTTGATTCGGAGATAGAAGCCGCTG 
AAAAAGCTCATGAAGATGAACCATGACATAGCTCAAAGATTACTTAGATATAGTAGTTCAAC 
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CTTACTAATTTTTGTTGCATAGTGCAAATAGACTTCTTGAATGCTTTGTAGAGGTGAACCCA 
AACTTGTCATATCAATTCTATAGTG 

SEQIDNO508 

AGGCNCTGCTNCTGGGTCCNACTNTGCTACACAAGNAANAAAANAGCAAGCTCTCGTTGGTT 
TNCTCT 

SEQIDNO509 

GGNTCGGAAATCNCGGATGNAAGNCCCCAAGNCGNANGATNNNANGCGCAGGGGTATAGNAT 
GANANNCCTATGCTATANGGAGCTACAGTAGGCNAGNTTATTGAGGCCTGACATTNCC 



SEQIDNO510 

GNCNCGGTTTNNGCTCCGCNATTGATCGTTACTGTGACTAGACAGAAACCTGNANGTCTTCA 
NACTTTNACAAAAGGAANGNGCTGACAAGGCAACAGGCCTTCCATCCTATGATCACGNAGAA 

TCAACTNTTGGAGCATTTGACAACATTGCGCTATAGCC 
SEQIDN0511 

AANCCCTACTTTATACATGANGTNTGTGAATACTTGTAANGGAAGNATNNNGANNAGNTTGG 
GATGCNAANGTATGTTCTGGTGTTATGCATNCTNCNANTGCTCTTGCTGAAATCCACAACTA 
NAATANTACTTGCACTACATTANGGCTGTNNTTANNCAATNANTAGTTTTTTGCTGATTTGC 

ANCTCCATGTATNGATAGCNGAGNGTNGACAATCNANNATTCCT 
SEQIDN0512 

NANCCCNCTGTAAGCTCNCTNAGGACTAGTNTAAAGGGGGGCAAACANCTGATGAATGCCAA 
CTGAGAT 

SEQIDN0513 

NNCTTTTTTNGTGNNNCATATTNATGTTTNTATNACAAAAGANNTGTNTAA 
SEQIDN0514 

GCCCCGATNTTTTAGGGNNAAACTCTGCATTTNTGAANGGAATGANGTCTATACGCATTGA 
SEQIDN0515 

ATNCNACNNTTGCNATGCNTNGTNCNGGGACTTGAAGCCNNGCAATCNNCTGNGGAATGCCA 
GCTNNGAT 

SEQIDN0516 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0517 

TCCTNATNTTAGCGGCCNGNNTGCNGTTCTGGTCANTGATGCNACTNTCGGNCNAATATNNT 
GATGNGTGCGACANNGGGA 

SEQIDN0518 

ATGTNCCGANNTTGTTATCCTNGCATGATNTANGGGAATGATNCTCTNNTGTAAATCAAGGT 
GCCGTAGGTAGTTNAGGGACANTNTATATAACATGCNGATATGNGTGTGAT 

SEQIDN0519 
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GCCGCTNGTATTNATCTGTTGAAGAAATTGCTGNTCAGTTTGTTCTGCAGCAGTATGACAAT 
CCACTTTCTAAGAAGCTCAACGATATCA 

SEQIDNO520 

CCTAACTNTAAGGCCGGCAAGTTCAAGACCAGTTTAGCAGACACTTCCAGAAAATCGCTTGA 
TGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCCATACA 

■ 

SEQIDN0521 

CCCTCTNTNATGNCCCNNAGCTGCTGTGTTAAAAATAGAGNCCAAGAGCTCATAAGAATNAT 
GTCCGAGGAAGGATTATACTGTGNCAAACAAATCNATANNTTCATNGTATATNGNGNGGGGN 
ANCAGTGCANCAAGTGTGGGGANTGGTTGCTGGAAAATATAGGATCAGA 

SEQIDN0522 

NNTAACAACCCATGNTNTANGCACAACAAGTGGAGCATATNCTAAAAGTTCCGGNGAAGAAC 
TTGAGAAGGAAAGAGAAAGAATGGTACCGAAATGGAGAGCGAGNGGATTT 

SEQIDN0523 

ATGCNNCTTGNNGTAACCTGCCCGACATTTATGCCNTCTNGNTTATGNTTGATGTTGCGTAT 
TCAAGTTATTGACATTTGGCTGAACAATTAGTTCAAGTTATTAGTTAGTATCTAGTATG 

SEQIDN0524 

TGTGCACATGNCTGATNGTGCTTGNTGGNTGTGGNTAAGGATATCGNNGAGCTAGNAGNACC 
NTACTTNGANCCGCTGNCATGATGGTTCGNTNGTNCNNGCTGCTGAGGNAAGACACTGTGTC 
NGCGGGACNCAACTCTCCAGCGCTTTATNAATG 

SEQIDN0525; 

TAAGGGCTGCTGAACACATCACCAATGACTCACAATCTTTGTGGCGGACTAAATTGTTTTTG 
TTTTTCACTGAAAAAGCCTTTGCTCACCG 

SEQIDN0526 

AANTCCCCCTGTAAAACGCCGCGCCAAAACTGGGGANAAAGAGCGGNCCAGCNNCCGATCCA 
NCGNTGAANNNACNGGNNGNGNCANNANNACNNGAGGGNANTTTNNAGG 

SEQIDN0527 

TCTCCAGAATCCTCATCAATGCTCAGTATGTATTAGTTCTTAGTGCCATTTTTTGAGAATGG 
CCAGNTTCAATGTAGGGTATAATTTATTGGCTCTTTTGGTTTGGCATTTGTGG 

SEQIDN0528 

AACGGGACCTTCGATCCAGACCTCAGAAACTCGCCGGAACCGTGACAAAATCCAACAACAAC 
NAACGGCTGAAGCTCTCCTTTCAGAAGTGTCGCTGCTGGTTGTTTTCAGTGAAGCAGGGGTC 

ATTGGTTTGG 
SEQIDN0529 

NTGAGCNCAATTNCTGCCAAGGNCNGNACGGNCGATGNTGAACTGAGNCCNAGAGGNAGCNN 
GCACTTACCCTTATNTNGGGGANGNNGAGGTATACAAGGTATTTTAGTATGGTATTCTTTGG 

AATCATTTCCGCTCNGNCCTAGTTTGTTGNTTCCTG 
SEQIDNO530 

CGTTGGAAANCCGTGANGNNTNGGGANANNNNNNCCANAANAAGTCGCCTAGAGGNGACCGA 
NCGNGTAANCAACCTTT 
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SEQIDN0531 

ACGCNNCTNGTNNATNAGCCACTGAACCNAAANNNTNANCTCCGCACGATGCTGACGGCGAC 
GGNTACG 

SEQIDN0532 

TCTTTNGAAAGNCCCTTGCATTTTNGNANAGGNNNCTTTNGCTTAGNCTTAGCAAGCTGNTG 
GGGAGAGTGGTCAANTNTTTNGNCAACANCTNAGCATNCACATGC 

SEQIDN0533 

ANTCCCCTGTNTTCTTGNTCACCNGTGTGGAGGNTGNACTGCTNCNTGGACAGGNCACAGTG 
GNGGACTGACNGTTGNNACAGCCNTATTGNGAGCG 

SEQIDN0534 

TAGCAAGGAAAGGGCTCTAATTCTTGCTCGACTCCTTGGGCGGCNTA 
SEQIDN0535 

AAATCNCCGATNNCNAATACCNAAGGAACATCAACAAANGACNTCTTACTATGAATCTTTTG 

TTTGATGTTTAGAGCTTATTTATTCTTATGATGTTGATGATGATNCTTTAGGCATCAAACTT 

CATACTTATATCTTTGTTATTGTATCTGGATGTTCAACTTCTAAGTGTTATGTTGTTTTTTA 
GTCTTTGAG 

SEQIDN0536 

NANCCCCCNTCNAACAAACCCNTGCTGTACCCATTTNACCGNTTGCAAAAGACATGAGCCTG 
NNGGAAAAAATTTACGATTCTATCCTTGTGATGGTG7y\AGTNTTNATTTATGATAAATCTAC 
CACTTTTGATTGGATTTCACGATCCAAAATAAAGGATGGTGTTGCATACTATAAGATTTTAG 
TTTGGAGATCGGTTTCCCTCTTGATC 

SEQIDN0537 

NACCCCAATNNAACAAGCCCGGGTACCGAGNNTCCNATATGATCGAGAACTTTTCAGGTTTG 

GTGCCCGANNTTAGGTTNCTCTTCTGTCTCGGC7\ATGGCTTTAATGGCCTTCAGTGCCAGAT 

CAAATTCCTCATCTTCACATATCATTCCGATTACTGGCCCATTGATGTGAGTAGGCAGAGAA 

TTGTTCATCATATTANGGGCCTCTTCATCCCTAAACATTATTTCCTTGACATTGATAAGGTC 
TTCCACTGCTCTC 

SEQIDN0538 

TTATNTATATTGTTAGACNTTGGAGTCTGAAATTAGNGNTGTTTGGGNTGTACGC 
SEQIDN0539 

GNAGAAATCNAATCNAAGTAGAGGAAGGGCGATACTGGGAAGGGGGGCCTTAGCN 
SEQIDNO540 

GGGNATGTCAAGTANGACANTATGGNCGANNCTNGAGCGTGCACNATGTCTATTNCAGCANC 
ACATTGANGATANCTGAGGANTGTCGCCAC 

SEQIDN0541 

AAGTTNNGCGANTATCCTTCGCTGAGTNTAAATCTATACANTCTTGAATCCTNATTACACTG 

TTAGAGAGATNATGAAAAAAGGACCTNTGAATCNAANNNCCTACTATTTTGCTTCGCCTTTA 
CC 

SEQIDN0542 
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CNNTNNACATTTAACAAGTGAGAGTTTGAAGCCCTTTCAACTTGCGCATGTGAAAGCATTGA 
ATCTTGCAAAAGGGGAATGCAGGATGATAAAGAAACTCTGNCATCTGTATGGAATAAAGCTA 
TTGAAATGTGCGAATCCAATTCACTTGCAANCTTTTTGAGAAGACAAGGGAAGTTGTCATCA 

ATTCGTC 
SEQIDN0543 

CNCTTTGAAGNNCCACCATCGTACANGGGANAANACGGCNACCANAATCCGGNCAAATTCNG 
GNGNNCTNCCNGAACNCNTNTTTTTNTTTGGGTGCCACCATCGNACCGGNCAC 

SEQIDN0544;BSTC4-34-185 

CTGNCAAACCCNGGGNAGTCAGGNAAACGTCCANCATGGATCTGGATCNNGGCACAGNGAAG 
GCAACGCNANCGACNTAGNNACNNNANGACTGTATNAAACANAGNCNGGANTNATACTGANN 

NCANNNANNAGNNTANGAAGNTTCANGGCNC 
SEQIDN0545 

TTGNNGTNGNAGGNGGAACGNAGGGCAGTTTNNTTCCNAGGGANCACCANCNANNNNGNTNN 

TNNNNNAANNTTTTTTGNTATANNCACACGGANNTNNNNACNANCGAGGGGGGNTTTTTTCT 

ACANTNNATTNCGTGGGNNANAATCAAACGATGANNNCNGNGNNTNCNGNGGANATGNNCGA 

CNNGNNTANNGNTCGACCNCNACCACNNNACNGGAGNNGNNGANNGTCGNNNCTCATTAANG 

AGAGNTTAANCNGAGTGNAGTNAATNACGNCNANANNGANATNTANNTTTTNNNCNNGGNCN ■ 

NNTANNTANNNTNACNTANNACNNNNGTATNNTNCGGNNGCNTTCCCANNNNNNTNTANNNC 

TNNNTCGAATAAGANNCNCGGNCANGNNCNANTCCCNGCTNNNCAAAACACGNNAGNGGAGG 

GTCCGCGNAGGCAGTGAATCCCGTGATTNANCTACAAGTGCCTTGNGTGCAGNTGNCAANAA 

CAGGAAATACTTNTGGAATAAGTGATGCATNCAGAAATGCTACTTCTGGCTCCAAAGTTGCT 

GACTGCA 
SEQIDN0546 

GAGGTACATAGCAGCTACCAGGCAGTGATTCAAAGTAGAGCTGCATTCCGTTCGTAGGCCTT 
AGAATAACTTCCTAGTTCCTATATACTGTTTCCATTTTATTTCAGACAGTATTGTAATTCCT 
TTCCAAATATTGTATTTAGTATAATCCCGAAGCTCATGTACTTGTGACTTCACATATTGGGA 
TATTCGCGTTAGATGTTGGTTTTAGACTTATTGTGTTTGTATCAGAATTGCCTTTACGTTTT 

GTTA 

4 

SEQIDN0547 

CTCTTTGGAAAGCCCTCATNGNGTGAGAANACNANGCGGNAAANNCTNTTGNNACGCCNATT 
ACTCAGGACNCATCATTTTTTTCNNNNNCACGCTANAAGGGGGACTATNNGGCCTAAGGANA 

TNCAGGNGGNNANGCGTANTACGGGAGAAAGGGC 
SEQIDN054 8 

CCTTTNGAGGCGGCATGGATGTAGCAGGGAAAGGCTCTAATTCTTGCTCGACTCCTTGGGCG 
GCNTA 

SEQIDN054 9 

CCNTTGNTGAGCCTATCTNNGTTCCGAAANTGAAACCGACGCTAACTTTCTCCACTAGTCNG 
CCTTTCAGTA 

SEQIDNO550 

CTTTGGNAAGACCGCGAAGTTGAAGGACAGGGAGAGATGANGNGCGNCTCCTTAGGGNACGA 
TCCCTANGNCNNACCGCNNTCACACAGNGTNTGGGGTA 
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SEQIDN0551 

GTTNATNATGCGATTCTTTTTCTGCCTANGGTGGNAGNGACCAAGGAATTGCAGGACCAATT 
TTTTTTGGGTTATNTATCCCTGCTCTAAGGGCACTTCATTGGTATAGGTTGNAAGTGTAAGG 
NNTATTGTTGGCTGGCTA 



SEQIDN0552 

GAANNCCCNGACNNATTTGGGAAAACCACCTGANG7^AGAAANGATATGTNGCATNTAAAGNT 
GACTTATGAGTANNAGGCTANGATNTGTTACCANACCCGNGNTGGTAATCNNAGNACTATAT 
NGAACATNTTANTTGNACCTTCTNANTACATNANCNGNTATGAGNACCANTATTACNCNGNA 
CTTNATTNACANNTGCGNNGNNAGGANATTANNGGTGNCNCTNGATCGANTTCTGACTCATA 
NTNTNACNNANCNAATGNACNNNTCNAANGTNNTNANATNATNNNTCNCGTGAATCGAGNTT 
TAGCTATNGCNGCNNACCACGTG7\AGAAGAAATGATTTGTTGCACGTAAAGCTGACTTATGA 
GACNGAGGTTATGATATGTTACCATACCCGAGTTTGTAATCTTCGCACTATATTGAACATCT 
AGTTGTAGCAGTTTTTTTTATCATCTGCTATTTGTGCATTA 

SEQIDN0553 

TNAAAAANAATGGATAGCACTAACACAAAGGCGGCAAGTTCAAGACCAGCTTTGCAGACACT 
TCCAGAAAATCGCTTGATGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCC 
ATACA 

SEQIDN0554 

TGCCCNGTGCNTGGTTGTGGCNAGNGNGCTAGANGANTCCNGANGAGGNGNAGACCGNGAAA 
CCCACCGA 

SEQIDN0555 

CTTTGGAAGGGCCNNAAGCTNNNGGGANTCNGCNATAGGGGAATNAACCNATGTGCATGCAA 
C AAAC AAG CCGNTNNATGT CAN G A 

SEQIDN0556 

CTTCGTNNAGANCAGGGATTGTTGNTTTCCAGCGNACGATTCGAGGTTCGGATTNGGNATTT 
CGATGTCTCANTCCANGGGATTGTTGCTTTGTTTAGCCCGA 

SEQIDN0557 

CNTGNCNTNCCGCNGCTCCTNCNGTGANNNCNGCTGCTNTACGGAGCTGATNCTGTNNNTGT 
CAAGGAGGNCGACACAGGTANGNNCCNNCGNGAAAGTGTGTANATGACAATATCAAGATTGT 
NNGGAGA 

SEQIDN0558 

CTTTGGANANTCCGAAGAGATNAGGNAGACGACCCTGATCCTGNAGGCTGAGCAAGAANNNA 
GNNCACAATGAGCNATNGCTANGNNAGCNGACANGCAAACTANCTCNNAANCTNTNCTGGTG 
ATNCCGNTGATCANGGAGGNAGCCTTCNACCAGACAGNCNTGACAGGA 
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SEQIDN0559 

CNCTTNGNCACAGCCCTATTTGTATTTATGTTTGAATTTTATGACAAAATGGTCGTATTTTT 
CTCA 

SEQIDNO560 

ACGACGCGTANAANATCTGAAGGATACCTATGNNCAANCGAACCAATGCACGGATATCCNTT 
TATAACCCAAATCTTCAGTNGNGAATATCTCTNCAGTTCCTTTTCTATTGC 

SEQIDN0561 

CGAGTTTNATGGCGNNGCGATGTGGACATTCGTTGTGGNGGCCTAATGCTGAAAAGGGNTAT 
TGATATGGCAAGAGGAACCCTCTGCAATGCAGAAATTGANNGTGGCTCC 



SEQIDN0562" 

ACNTGACTTCNTNCAACCAGCCATCTATANNANAGGAAAAATANTNTGAGGATTCCCA 
SEQIDN0563 

CTGTCCNTTTTNTGNGACCTNGTGCNGGCNTNCTCTGANNGNGCCCNGTNAGCGNCCAACTC 
NNATCAAGCTCCTTNCAANTGANTGAGGACATGATGNGGTNATTTACTCGTGANGAAAGGCA 
GCTNATTCCTGACCCNATGGAAGCAGNNAGGAAATCNGCTCCTNGCTNCNNACTGNGCANGG 
NTNNANNGTACTCGNCCATACNGANGTCNCACANNATTGCTANATTGTTNCTAGCA 

SEQIDN0564 

NCTGGCAGTACCAAAGGTCCTATGGATTGTTACTTCNCGCAAAAATCTGGAGATAAGGAAGG 
AAAAAGTGGTAATCCTCAAATTGATGCCAAANCGATTTTGAGGGATCGTGCAATTACAATGT 

TTGCGCGGTGGATGTATGATGCAGGTCTTCCTT 
SEQIDN0565 

GNNGCTAGCAGCTCGGAGTNTNTTGNGGTCCNGCNGAAATTTTTNNTGGNGNACACNGGAAN 
TTGNNNNATNTCTNATGGNGTATGGTAAGAACTNATTTTTTGANATTGANGGNCGANATGTT 
CTTNGGGGGGGNNCCGTCACACCTGTCACTTCATTTCATTTTT 

SEQIDN0566 

AAAGGAAACTAGTTGGAACTTGTT 
SEQIDN0567 

AAG AT GAT G AGC AGATTGCAAGGAGGAAGCACC T 
SEQIDN0568 

ACTACGACTGGCAAAGATCAAGTTGTAGTAACTAATAAATACTCGAGAGAGAACAGTGGAAA 
TCTTTTTGT 

SEQIDN0569 

TNCANGNCGCTGCNCANGTTCCTNGGNAAACAGGCCGNCTTGGGTTGTACTCAGGTACTCAT 
GAAACTTGTATAGNGCTNGTAAGAAGTTTGNGTNGTTCGGT 

SEQIDNO57 0 

CTGGCNGNACCAAAGGNCCTGNAGCCCGGTANATTCNCCCCTGTAGCTNCANACTTCCTGAN 
TNTACTNTNGATNNNACATTATGGGGNNAGACCACNATNTNNATNNTCNTCAGCTNGTGACT 
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TCATGAGNTNTCTTGGCCATGNNAAGCTAAGACATCAATATGTGAGNGCGNTCACGAGCATA 
TGCCNGAGCAGACATTCATAGAGACTCTNTTATTAGTGG 

SEQIDN0571 

TTTCTGCTGNNTCAGTGAGGTTAGATCGTAATGGAGCACTTTTTATGGAGAACATCAAACAA 
GAAGTTGAAAGTATTGATGCTGATGTAACACCTTCTCGAATACAAACTGCCT 

« 

SEQIDN0572 

TTTGCATGNCTTCGAAGGNCAGTGCTTGNTCTGAACCCGTNNCTTGGACTTGACAACTAGCA 
TCTTCTCTTTGCATGCTGCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTA 
TGTATTACAAACTATTATTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGA 
AAGTAATCTCTCT 



SEQIDN0573 

GGNTTCTAATTTCTAAGTTGATGGCTCAACCAAAGATATTTAGTACTGAACTGATTGTACTA 
ATTGTTCTATAAAATTACGGGGTTTAGAT 

SEQIDN0574 

ATATGATGNAGTCCGGAAGATCGAATNTGGGGAAGGTCCTTCTGGGATCAAATAGGCTAGAT 
TTACTTGTTTTTCCTAAAAATGTAATAAGGCCAAGTGCCAGTAGTGACTTATTTTATTATTT 
TAGTGTCGTTTTGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCAT 

S.EQIDN057.5 

TTTAGCTACGANGGTTCTCTNCGAGATTATATTCTCAACGCTNATGCNCACGCNTTTGCTTC 
TCGTGT 

SEQIDN0576 

CAGNGCTNGAGCTGAACCCGGNGCTNGGACTTGACAACTAGCATCNTCTCTTTGCATGCTGC 
CCTCATNTATTGCCAATGTAATTTCTCCNNTAGCANANCATNATGNNNTACAAACTATTATT 
ATGATNGTGAANAACTCAGTGAAACGTTCAANCAATCTGAAAGAAAGTAATCTCTCTTTCCT 

SEQIDN0577 

GGGAGGNAAANTNGCCCTGAAACNTAAGAGGCTGAGACTTGTCATAAAGAAACAAACTNTAT 
TCANGCANGAGAAGAAAGCAGTAAGGAAAATCAGCAAAATAGCAATGAGGTTAGCAAAGTTA 
TTGATAACAACGGAGGGACCAAGGATGTACAACACAAGAAGGAGAACATGAACAAAAGAGCT 
AT G AC C AC T GG AAAAAT T GAGC AG AT CATG 

SEQIDN0578 

ATACGAAGGTTCAGTGCTAGTAGCTGAACCCCGTTGCTTGGGAATTGATAGTTTGGGTGACA 
G 

SEQIDN0579 

AG7\AATCTGCCNGGTTTGCATGGATATAAGCAAATGCTCAAGAATGGTGCTTGGGAACAGTG 
CATGTCTGCCTTGGAGCCCTCTGTGAAGGGCAAGCTG 

SEQIDNO580 

GCTCAAGGGGAGGTGGCNCCAGAGNNNAGTGCGNGGTTGGGGNAAAGGGTGCAGATTCTNCN 
AAGGNCCGTCAAAAGAGCCCATGNCCAACAATTTACTAATGATTCACAAGAGCNNTGGGGGN 
GG 
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SEQIDN0581 

NATGTGTATTCCTGAANNANCTNANTGNNCAATTATTCAACCANTNATTNTACCAAGTTCAN 
TGTTANCCAGANTANNCNTCATTNATCTNTNTACATGCNTCACTAAGATNTTATTTGTAACA 

AGNGGTTTTGTTGGNTGG 
SEQIDN0582 

GCTCTAAAACCAACCTTTATCAGTCAGAAATCAGCTTTCAAACTCCATAAACACAGCAGTTT 
GGTTTTCTTCACCATCGATTCTATTTTCCGGTCGCGGTTCGTCACATTTTTTGAGTTCAAAG 
CTATCAAACAATTGAATTTTAGACTTATTTGAGGTTTATTTCTCCCTTTCCGCTATTATTTT 

TGG 



SEQIDN0583 

TCTATCACAATAGAGTCCTTTGCTCGGNGAAGAGATGGGGCACATCAAGCCACATTATCCAC 
TCATCACAATAGTAGAGGCCACACACAGAGAGGAAATGCGCCAAAAGGAGCTGCTGTATATA 
CACATCAAAGTTATAGGAAGCATGCATCAGGAAGAGGAAATGGGCTTGTTGGAGCTGCTCTA 
CATTCTCGTCAGAATAATATGGGCATGGGCAGAGGACAAGTGCCAAATGGTGTTCCTCAACT 
CAATCATCGCAATGTGGGAGGTCAATTTCGCGGACGCGAAGCAAAGAATTCCCATTTGG 

SEQIDN0584 

• NGGAAGGNGTTTGGTGTANGGGTGGGGGGATTGGGGTGGACCCCAGGTGGGGGGG 
SEQIDN0585 

AAANGAGGAAAANTAATNTATGGCTANNNACANATGACAAGGACATAAGGTAACTNNGCATT 
CTANCC 

SEQIDN0586 

TTTGGCTGCCNTTTGCTAATCCNTTGCAGTNTNTGTGCATAATNNGAGTAGGGGTATGAAGA 
TGCCCACCTNTTGTTCATTCACTTNAAGGATAATTACAAGCCAACTATGGAATGTGACG 

SEQIDN0587 

AGANTAGAATGTTGTAAGAGTATTGAACTCAAAGCAGTATTGTAAGTTTGTAAGTAAGTTGA 
AAGTATTGAACTAAAGGCTCGAGGTATTCAACTCGAAGTAGTGGTGTAAAAGTATTGAATTG 

GAAGTGCG 
SEQIDN0588 

CCCCTCATTTCACACATTCTTGAAACCAGGTGCACTTGCCCAAATCAGGTACTCCAAAATCT 
CTGCTAAATCAAGGTTGAAAACTGTTCAGTCCCTGTTTGCTATTTATAGTCAACAAATTCCC 

SEQIDN0589 

TTTGGGAANCTTTTGGAAGGTCCCATATACGTNTTTNNCAANTNANCCNGGGGCCTTCTNGG 
TTTTTTTTATGNTTTNATACGTCGGNTTGAGAANATTGNNTGNTTACAAGNAGGTGAGGAAT 

ANATAATATGATTCCTTATCTTCTTTGCG 
SEQIDNO590 

CAGCGGAATGCCACCGAGGCGATACCAGCGATGCTGCACGTGATGGCATGNTCTGCTTCGGC 
GCGACCTGGGGCAGAATAGAGGAATCCGGTATAGCGCTTCTCGCCCAACCGGTACTGTGCGA 

GGAAGCTAATCTTATTCATGCCG 
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SEQIDN0591 

GGCCNGTAGTTGGGCTNGNNACGCNCCNNAGNACCNACTGGCCCNGNNAANGAGNATNAGNT 
NNTCATGCNTTATACNGGNACTNACAACCCACCANCCATGCCATAGCAAAGAAGCGAGNTAT 
AAACACAAGNTCNGGACCTNTGCCTATNCCAATCAAAATTTACAAAGCCACGGNTACAAACT 

NCTAAACG 
SEQIDN0592 

TTTNNNCCTTTCNTNNCATGNTATACGAAGGTCAGTGCTAGAGCTGAACCCGTTGCTTGGAA 
TTGATAGTTTGGGTGACAG 

SEQIDN0593 

CCAGACTCGCGATANCTGNNTNANCTAACANTAGCATTNTGANGANGTACCTGNGACTTNCA 
CATAGCAGCGGTGGGTCGAACAG 

SEQIDN0594 

ATCTATTGGATTTATGCTTTGGNTTCTGCTTCTAAAATATAGAAATTCTGGAGAATTGAAGC 
TCGTTTCTATTCGAGGTTGC7\ATTCCAGNTCGAAATCATGGNCCATAGCTCGCTCGAGGATT 
GCTTTTTCTTTGGAGATTATTTTGCACTGNACCCGTTGAAAAATTTTCAGNAAC7^AAGGTCC 
ATCTTCCCCATTGCAACTTCA 

SEQIDN0595 

AGANCCTATGATT^ACANGATNGGAGGACTCATGGCTNAGGCTTGGCTGGAAACATCGGNGCT 
GGGGCCCACCACTCTGAACCATATCGTNTAGGACGGTCTCCTACTAGCTGGCCTCCACCTCT 
AGCGGTCTGGCCTCCACATATAACGACCTGACCTATACCTNTAGCGGGATGACCCCTACCTC 
GNNCTCCCAGACCCCTACCTNTAAATGGCTTGGCAGGCAATGCCAGAATCATGGCACGGTGA 
AC T N T GN T AC T G C G AC T G AA C AC C C A 

SEQIDN0596 

TCNCAGCATCGCAAGTGATTTACTTTGNCTGGNGCCNCCAAGNTGGAAGGANGTTAGCCCTG 
TAATCAAGGCGNTNNTGNCCTTGCCTC 

SEQIDN0597 

CNCANCAACCNTTGTANATATGCNCTTTTTACGCTCGAAATTTTTTAGCTGATTGAAGAGGG 
TNNTCTCCNTCTTGGCAGGTATAAGGGGAAAGAAGCTGCTTATTGTAGCAGCAAGTTAGNGA 
TC 

SEQIDN0598 

GAGACCGTTGGCCGCATAAACAGCTCCANCTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTT 
CTTCTGAAATATATATAGACAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATG 
TGTTCCTAGCTATTAGTTTCC 

SEQIDN0599 

CTTGCCGGTCCATTTGCAGGTTGAAGTGGCAGCTTCTGGATCATGAAACGATTGAGTGCAGC 
GTCTGCCAGCATCCATTCCTTG 

SEQIDNO600 

CNCCACGTCTNTGTGCCGlNlAGCCNCCCCCTCGCNCCAATNCGGGTGTCATTNCANCGNCANC 
GATTTTTACCTACAAGATAGGTGGNTCGGATCGANNCGCNACATTNGATCAGATTTGNCGGT 

GC 
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SEQIDN0601 

TNCNGNCCCNNTTTTGCNCAAANCCTTGAANCTCCAACCACTACCACCCCAAAATACCNACA 
TNNNTNGATTNAGCTCTTCAAGACCTAGCTATTGNTGNCAATTCTACCCCAAAAATCCGGCG 

ACCAAAATCTGGC 
SEQIDNO602 

TCNNNAACACACCCTAACCTTCAACNCCC 
SEQIDNO603 

NCAACAGCTCTAACTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTTCTTCTGAAATATATAT 
AGANAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATGTGTTCCTAGCTATTAG 

TTTCC 

SEQIDNO604 

NAGCCCGGAGCTTTTNAATTCTTTCATAACCCAAGGAGAAGAATAGGACTCTTTACCAGTAT 
CATAACCTCTCNATGGGAAATGGAACTTAGATCACGATGTGAACCTACTTATGAGTGGAATT 
TCGTTGACAAGCAAATTCCCCGGGAAAACAAACTTTTCTCCAATTGAGATGCTCTCTTCATT 
TATGGATTCTATGCGAGATTCGGTTAGTGCGAAGTGTGATCCTGGCTNAGAAGGAAAAGCTA 

TATGC 

SEQIDNO605 

GGNCCAAAATCGGNANCATCTCC 
SEQIDNO606 

TNCCCAGCATTCCGCGCTACCANAGAAAAAGATGGATCCACCANAGATNAAACAAGTTATAT 
TGGGTATAGGCATATGACGAANACCAGAGAGACAAGGGCAGTTCTATGAT 

SEQIDNO607 

GACCCTTCCCNCACCGTNTCGNATCTTCGNTTGAAGANTCGAGCNGGACCCCAACCTATGTC 
ANNCCCCCCCAAATCCATACCAGGNATCCANCTGNCCTCCCTTGNGACCAAACCAAGCTTGG 
CTTTGNCCGAATNTAACCAGAAAANCCCANGNCCNAANTCAGGTCCAAGAACCCTAGAAATC 

CGGAATCTGAGGGTTTTGTNNGA 
SEQIDNO608 

TGCTGTTTTCAGGTCTGCTGATTNTGTGACGACGTTAGAAATCTAGTCTCAATCCCACTGTA 
TGTAGTGTAGAGTAAACAGTTTTGTTGGGCAGCTCAAGAGCTGCTGCAGGTATTTGATGTTA 
GTTCCACGGGCTCTCCAAAATCTTGAAGGCCAGATTTGAAGAAATATCCTAAAAATATGTCT 

TCTTATTCG 
SEQIDNO609 

GNAGNGGCGNNCAGCATCNNTNGATCTGAAAGGGAACATGATTGTNTGGTNAAACTCGTAAC 
GGTAATTAATNACCTTGNTANGTCC 

SEQIDNO610 

GGGGTCGGGTTTCCGGCGAGTCAAGGNGTAATCTTGTTGTTCTGACAACGAGTCGATGTNGA 
AAGTCACGTAACTCAATACCAAAGGAAAGGGCN 

SEQIDN0611 
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CCCCGGACGTTTGAATCTGGGCCAGGTCCTTCTGGTACCAAATANGGCTAGATTTATTTACT 
TTTTCCTAAAATGTAATAAGGCCAAATGCCAGNACTGACTTATTTTTTATCATCTTAGTGTC 
GTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDN0612 

CCCCNCAGCTTNGAACATAACCCCCCGAGCATGACTGCTTNTGATTTACTTANCTTATGCAG 
TTTTNNANACGTTCCCACAAGAACACGTTCNTCGTTGNCAAACAGAGATTNGAAGGTTTGTC 
ATGATTCTGTTACTGNAGATGAGAATGCTCATGAGGGCGGGCTCCCTAAGGAAACTGAAGTG 
CATTCCCAAGACATCTCTGTGGATGCGAAAAGCCTCAATTCTGAGAAATTGAAAGCGCCATC 
CATGGAGGAAGAATCATGTCTTACTTATGCCA 



SEQIDN0613 

NGCCTACNGGCACNTCGGCTTNNTACTTNTGTGGATGGCTCCNNGCTAGCCAGTNTNAGANA 
NTAACNGNTGCATCCGNGACNTATNNATGAATTNCCATTGTTGTCNGATGGTNGGTCAGGGC 
ATAACCTGTTANGNTGGANANCATGATGTGCTGTGGATACACAAAGAATGNAGGCAGACATT 
CACAGAGTGCTTTCTCCAATAGCACAAGAAAAGGAACCATCGGTTTNTACACCCAGAGNGGN 
AACCCCNATTGTTTCCAANCNAAGCAGTAAATTCATGGGAAGNCCTTCTTCACAAGCAGGNT 
CATGGAGGCCC7\AGCATCCAACAGTTGTTGCAATAAAGAAGCAAATTGTGTGGAGTCCTCTG 
AAGATGAAGGCCATGAG7U\GTAGGCAATAGGAAGCCCCTCTCTTC 

SEQIDN0614 

ACAACGGCTAGGTTCCGCGAGTCANCCTGGNAAAGGAGCCTGGNNANNGTANAGANGACCGA 
CAGTNNCGNATACAGNCNCGAGAACGTNA 

SEQIDN0615 

CCAATAGGCTCAAAACGCAACAAAAACCAAAAGAAGAACGAAATTCCCTTGNTTGGATTCAT 
^lATCTCAATTGTCTTGTTTTGTCTGGTACGTGAAAATGTTGATA 

SEQIDN0616 

GGAAGTAGCTGCCTNCTTGTGNTGAAGGCTTGCNGCTGTCTNCCTTCATTTGTTAGCCTAGT 
AAANNTGGCNTATATNTNCGATGGCCGCTCTCATGTGNTAAGCACNTTTGCTNAACCATTTC 
TATGATAGCATGAGAATGATGATGCTATGAGTTACAATGCTGGGA 

SEQIDN0617 

NGGAGGGGTCCGGNAGATGAATCTGGGAAAGGTCCTTCTGGTACCAAATAGGCTAGATTTAT 
TTACTTTTTCCTAAAATGTAATAAGGCCAAATGCCAGTACTGACTTATTTTTTATCATCTTA 
GTGTCGTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDN0618 

CAAGGTTTTGGTCTTTCTTTTTTGGAGATTGGTTGTGCTATCTTAGCTCCA 
SEQIDN0619 

TACCCACNCCACCTCCCGCTGCTGNTCCTTTGNCTTCANCTCATTCNAAGCNTGACNNCACT 
NCCAATGTGTAAAGCTNAGNGGCGTACTCGCT 

SEQIDNO620 
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TNGTTGCTTCTTCTCCACGCCTTCTCCGGCACTACTTCTTCTTNTCCGGTCGAAAATTCGGC 
AGATCCCTCTCATTTTCTGGCTGGGCCGTTCATCTTNCTCAGCACACCACAAACAATCGATC 
TTCTCGACCTCTCAACCATAAAGCCACCATCGAATCCCTCTCATCCGTTTACTCGAACATAC 
TAGTTACAGAACTAAATAAACTTTCAAANTTTTTGCTGTT 

SEQIDN0621 

GCACTAC7VAGAAGCTGCTGNGGCTTCTTGNAGGGTTTTGTGNGANNATACCACTCTTGATNN 
NTGTTNNCNCCGATGGTTATNGGTTTCANNGGAAGCNTCTTCAAGTCTTACAAATCTTATGA 
GGNAGCAANAGNAGTATTCAATGACTTCCAAAAAGAAAATATTTACAGTGAAGAACAATCTT 
CAAGTTTGTGTATTGATGAAAGTGATATTGGAGCAAGTGTTATGTCATCTGTATTGTTAGCT 
GGAATGTTTGTAGGGATGAAGATTTCAAAAAGTTCTTCAGTTTGATTGTTGATAAGAGTATT 
TTGCTCAATTTTTTATTATNGCTTAGNTTGGGTATTATTAGNTNGATTGTNNAGTTTGANGN 
NATACTGGNTGNCGCATTCAACCTCTGTNGAATNGAGTATTTAGGATGCCNAAGCCNTTATC 
TTTTTGACTCCCNGTTGGNATGNAATAAAANATGTCTGNTGATT 

SEQIDN0622 

ANTACANNTAAAGGTNTTAGCTGCTGACATTTNGAATTGTCGCTCAAGCTGNTGNTTGGATT 
GCTTGTCNCTGAAATTTGNATTTTTGAGTGTTCGAGTNCGATNNCAATTTCAGAAAGTGAAG 

CTACATTNTGTTGAATCTNCTATTG 
SEQIDN0623 

CTAGGCGTGTTAGTCGACAAAGCATAGCCCACGTTCTGTGTTTTTGGATCGCAGTTCATCGT 
CAAATTCTAGGCGTAGTTCTAGTGGTACTAGTTCGAAGCATCCGTACAGTAGCT 

SEQIDN0624 

AGAACCAATCCCCAAATTTTTGGGGTACCCACTCCACCTCCCGCTGCTGCTCCTTTGCCTTC 
AACTCATTTCAAGCATGACAACACTTCCAATGTGTAAAGCTTCGAGGCATACTCGCT 

SEQIDN0625 

CAATTAGCNTGTGCNAGNCANAANAGGGAAGAGAAGNAATNTTTGTATAGCTTCTTGACAAA 
TGTAGGTNTTAGTGATCCTTGNTATTTACTTAT 

SEQIDN0626 

TTGTAATGCTTTGTTATCCACCACTGGTGTCGAACAATGTTCAGTGTTTTCTTCTAATGGTT 
AGTTCAAGTTGTTGTGGATAAATGATTATACTGTGCTCTTCGTAAACATAGGATGCATTTGT 

ACCAT 
SEQIDN0627 

TGGNTGATATCATTATAGATATAGGGCTTCACTCCCTAATCNNTNTTTTTCCAAGGTNTACA 
CAANCCTGATTNTTCNNCT 

SEQIDN0628 

AGGTTGATGAAGAAAATGAAAGACTAATAGTTGATGAAGTATGTGAAGCAATGAACAAGATC 
AATGTTTACAACCGATCATGAGTTTGAAGGAGTAGAAGAAGAGTGTGCTGAATTTGCATTTG 
CCTAAAGGAAACCACTTGCGTTTCCCCGAAGATAACTGAATGAAAAACTTTGTTTTTTTTCC 
GCTTTCTGTGAAGACACCAATAGCTGAGGTGTTTTAGAAAGTATTACATTCTG 

SEQIDN0629 

NTTCNANCGAACNNTCCATGTGCTCATTNCATGCAATGCTGATGNNNAANNGTGTCCANNNG 
GCCGTTTACNCNTNGG 
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SEQIDNO630 

TGACCNAGGACCAANATTGAAGGAACATCAACAAANGACTTGTTACTATGAATCTTTNGCTT 
GNCGANTAGAGCTTATNTATTCTTATGATGNTGATGATGANGCTNTAGGCATNAAACTTCAT 
ACTAATATCTTTGNAATTGCATCTGGATGTTCAACTTCTAAGAGTTGTGATGGNCTTTAGAN 

TTTGAG 
SEQIDN0631 

TTTCTTCAAGANTGCCAAANNAAGCATGC2\ATGAGCAACGGTTGTCACACGACATATAGCAC 
TGTCAAGTTACTNACAAAAAGTGAGAAAAAGAAAAATGAGAGAGTCTTACTAGTGAAAACCT 
CCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAAATGAGAGAGACTTGTTGGTGAA 
AACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGCTGATGCGAAGAATTGGCATAAA 
CAAGCCTGACTTCAAAGGTCATAAGAATGGTATAAGGGGAAGATTGGATTAGTTTGGTAGAT 

CGGTCG 
SEQIDN0632 

GTGCAGGAGNTGGCCCAAAAGNANGGGAGNTGAATTTACTAATTCTGNTGNTGGC 
SEQIDN0633 

CCACNCCCCCTATTTTCCCCTATANGCCCNTTCTACATTGGCACNTTTCACAAACAAGNACG 
CTNACCCTTTNTTATGTNGGACTCTGTACNC 

SEQIDN0634 

GAGGCNTTNCATTTGANCTTCATTGNACCAACAACTTNACCACCATGGCACACTAGTTCCTT 
GNCGACGGGAAGCACCATGAAAAACGCTGTCCCTCACCACTAAAAGCTCACCGGAAAATAGT 
NGCCGGATAAGCTTCAGCACACCCAGGACCCTTCTCGCATCTCCTTCACACCAGCGACCCCT 
CCCCCCCGGNCG 

SEQIDN0635 

NNNACGNTCTCGAGTNTGNNGCCTTTCTCAAGACTGCCCAAlANAAGCATGCNATGNGCAACG 
GTTGTCACACGACATATAGNACTGTCAAGTTACTTACAAAAAGTGAGAAAAGGAANAATGAG 
AGAGTCTTACTAGTGAAAACCTCCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAA 
ATGAGAGAGACTTGTTGGTG7\AAACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGC 
TGATGCGAAGAATTGGCATA7\ACAAGCCTGACTTCAAAGGTCAT7\AGAATGGTATAAGGGGA 
AGATTGGATTAGTTTGGTAGATCGGTCG 

SEQIDN0636 

TCCCCAAANTCTGNTTGAATGAGNGNGCCCANACCAGGACNGCTTNGCCGCTAGACCCGGAC 
ANACN T C T T T T C G AN AAACN CAT C G AN C AG G G C A 

SEQIDN0637 

TTTGG7\AATCGCCCAAGACAATTTCTGGNATCGGGGAAGTTTGNAGAATNNATGCTATTGGC 
ATAANTCAGNAGTTTNNAGATNCGAANCTGCCANTAGACTCGCTAAAGCTGGCGCCTNACNT 
A 

SEQIDN0638 

TGCCCTAAAGCCGGGGAAAATCTNATTGGNGGCTG7y\AATGAACCAAAAAAGCTGAAGACAA 
AAGG7\ATGATCAAAGAAAAGGTTCGTAAATTATATTGATACANCTCTAGACAGTCTCCA 

SEQIDN0639 
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TCAAAAGGCAAGCAACCCCTTTGGTGGGCATAAGGGTATAAATGCCG 
SEQIDNO64 0 

NCTCTACACAGAAACTCGAAACCTACGCNTGACGGTCACGATTTCAGTANCCTTCCNNCTCC 
TGNGT 

SEQIDN0641 

TGNGGTGGGGGAGCTCGTCACCTGTCTATCAGGACCTTGNGTATACTGCCCAACCTGAAGCT 
ATGCAAATGTCACGACNCCTTAGTCT 

SEQIDN0642 

GATCCCTCNCTCAAATGCATTCTGATCAACTAAATTTGAAAGGCGAGGGCAATCGATGTTAT 
AGAAAGGGGTTTCGTTGGTGAATTTTCTTTGTTCATTTTGCGCAACAGCTTGTTGTCTTGAT 

AGTGAAGGAGTTTATTTTGTTTACAGAATATTAGTC 
SEQIDN064 3 

CCTACATCACCAAAGCTATCATCTATGAGCTGGTGGAAGGATGGAAGCACTCCATGTTTCAC 
ACTGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACC 

CTATTAC 
SEQIDN064 4 

GAAGGCCNTTNCGTTNNACACCAATGAGCCCTTTTCTTCTAAAAAACAAAAACACATTCAAA 
AACCATCCTTAGCAGCAGCAAAAGACCTCTAAAAATAAGTTCAAACCAGCTTTTTCTTTCTC 

CCTAAATAGTATGAAACCCGTCCAAATAAGC 
SEQIDN0645 

AATCTTTTCACCATCGGCCGCAATAATCGCCTCTGCGGCACGTTCAATCTGGNGTGGGCTCA 
AGAACAACAAGTATTTGGTCTGCGGATACTGCGCTGCTGCTAGTTCTTCTTCGGTGGTTTCC 

TCAGAATCTTTCAGCGCCTTGATGCGCTTTGCATCCACTTCCGCC 
SEQIDN064 6 

GGCTCATATCGATTATGGATCAGANATTACCGGAGAAGAAAGATTTTTACCTTTTTAGACTT 
ATACTAGGGATGAAACTCTNCTACTATATAAAGAGAAAGGTTTTCTTTTGNAACATATACTG 

GAACATGCAAATCAAAGCAATAGGAGTTTATTTTCTGCC 
SEQIDN0647 

GCNNTGGCNNATCCCACTNTATGGGCGGTAGCCAGGCGTATACCGAGGTCGGACAGATCACT 
TAGCGCTGNCGGGGGAAAAGGGCTTTGCATAACCCTNGCAGGACTCGTTTGNTTTACNCGCN 
TGNAGTNAGGACCTTTGNTGCGAGGNAGCCCGTAAAGCCGAGCAGCAAAGNCATATTCCTGA 

GCTGGTNAAATATTTCNGNCNGACNGGCCACGTNCC 
SEQIDN064 8 

CCNGGAACCTATTGACTCGACCTCAATCAAAGAAAAGGGATGGTGATTTCGCTCCATTTCCA 
GGCTGNTTCCTGGTGTTCAAAGGGTACTTTTGAGTGGCGTTTCAGGNGGNCTTTTTTAGCAA 
CGACACAACTATTTCGAACAGAGGTTTCAGCTGCGNTTCGAACAGTTTTGAGAGNGATTTCT 
GGNGGTTTNCGGGGCTAGAAGGATGCTGGTAGAGTTCTTGTCCGAGGTTTTGACATTTCAGA 

TTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 
SEQIDN064 9 
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CNCATCACCAAAGCTATCATCTATGAGCTGGNGGAAGGATGGAAGCACTCCATGTTTCACAC 
TGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACCCT 

ATTAC 

SEQIDNO650 

TCGTCTACGGANGATTGNTCAGGTACACGCTTCTGAAATTATGGATTGATGTACGTTTGAAT 
TGGAAGTTGAGTTGAAGTAAACAAAGNAAATGAATCGTTCACCTACTTTCACAATACCTGTG 
TTTCAAATGTAGCAATAGGA 

SEQIDN0651; 

CTACGGNNAACTCCTCATCTTNNCCCTTCTACTCCTTTGATGTCCAGAGCAACATTTTCCGG 
TGCCGGAATTGTGAAAGGGAGGTCAGCGCGAGCAGAATCACCAGCCATTGTGGCAATTTGGC 
ATAGTAAAAAGACAATGGAAAGGAAGGATGAAAGTTTTCGA 



SEQIDN0652 

CGAATGTCCTGATTGCACTGAAATGAAATGAAGAGGAAGCATATTTTTGTTGAAATTTCCGG 
TGGCTTCAATGCTNTCATTATAGNTTTGNAATAATTTTGGACTGNATTGAACTGATGAACTG 
TTAGGCTTGAGTTTGATCATTTGGACTA 

SEQIDN0653 

CTGGTGTCGAACAATGTTCAGNGTTTTCTTCTAATGGTTAGTTCAAGTTGTTGTGGATAAAT 
GATTATACTGTGCTNTTCNTAAACATAGGATGCATTTGTACCAT 

SEQIDN0654 

TTCTNCGGCAGAAGTCAAGCTATCTATCAAGTGCACTTGACCATGATAAGGCGACAATCCCG 
GAGGGTAACTCTAGAGGAGGTACATGCTCGCGGCTTTGATCTCTCAGCCGATATTGAAAGGA 
CGAAGATTTTGGAAGAAGAGGCTGCCACTCAGCTTTCTGATGAGGATGATTCAGCCAGTGGC 
TCTAAGAGTGGAGGAGACGAAGATGAAGTCCCCGAGGGTGAGGCTCTCGAAGATGCGGCTCC 
TAAAGATGAAACTGCTGAAAATATGACCCCGAAGTAGTTTTGGGTTTCCTTATTTTGTTTCT 
GTTCAAGTCTCCCTTATGTAAATATCTCCTA 

SEQIDN0655 

GNNTGCTCTTGATTTTTCTGAAAAATCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGTCAT 
AC C AAG G AG G T G AG G C T G AAAG T AAAG AG AAT G A C G A C AAT T C AT C T AT A T G G T C AAT T CAA 
GTGAATGCAAGTACTAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGA 
AGAAGAATATGATGATGATAACTATGATGT^AAATGAAGAAGATGGAGATTTAGTTGATGAAC 

TGTGTGAAGCAATTAGCAAGA 
SEQIDN0656 

ATGAGGTGTTGGGTTACATCTCTATTTCCCTTTTTGTACCNTCCACGTGGACACTTCTTCTC 
CTTTAGTTTTGATTCTTTGTCTGCAATGCCCCTCTTTCCAACCTCTCAAATGCCTGGACAAC 
AGATAATCTCGTTCTTGTTTGNTGCGACAAATGTTGTTCATAAGTTGTGTTTATTAT7\AGAT 
ATTGAACATCATAGCTTCCACTTAGTTCTTTAGCTAATGTGAAAGTTGCTTATGG 

SEQIDN0657 

ACGTTGAGAGCCGTAAGCCAGAAACTGGAGAGGAAGATACAAATGCATCTGCCGGTTCAACT 
GGAGTTGATAGCATGGCTGATAGCATAAAATCATTCACTTGTAATCAGAATTTTACAGATAC 
TGAGGCTTGCACGTCAGCAATAGGTCTATCAGCTCATGATGATCAGGCATCAGATATTGCAG 
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ACCCTGAAGAAGCTGCTGTGACAGAATCAGCTGTAGTAAGTCAGGAATGTGCCTCTAATTTG 
G 

SEQIDN0658 

GGATGAGAGAAAGCCAAGTCGGACGGTTTGGTGAANCCAGAACTAATTCAGCAGATCGTTAT 
AGTGGACAGAGAAGCTGATTTTGAAAATGCTCTTCAGAATGGTGGGGGGAAGATAGCTCCTG 
GTGGTGTAATCAGTGTAAAATCCAACAAATTCAAGCTTGAGAAGCATTTNGAGCCGGNGACT 
GAAAAGAG TGGCN AG AAAAAGAAACAAAAAACC AT TCT GGA 

SEQIDN0659 

AGCCTGNCCTAAACCAGTNTTCGATCTNTGCTCTGCTGCCATTTGTNGAACCATTGGCACAG 
TGGAACTGAAAAGAAGAACGCGTCCATGCTGTCCTTGTCCAATCACTGTCCA 



SEQIDNO660 

AAAGCAACTGTTTNTTAGAGTNCATGGGTTTAGCCATGGCCCATNCTTNATTAGNCCNAAAC 
ACTCCCNAAGATATNGATATTGGNCACAACAAAGGCCCGTGCAGAAGATGGTGTGCCACTCC 

CACCA 

SEQIDN0661 

ACGGGGNNNTTGTCCCATTGACGTATCTCACAACTATTTTAANNGNCAAACCCGAAGTGGTA 
TGTGGTGTGGTCTGCAAATATGAACNCTCACATTCTTCCCGNGGTGCGTAGTTAGCTACAAA 
TATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGC 
TTCAAATGCAATGGGNACA 

SEQIDN0662 

TTGAGAAAGTTTTGTTTTTAAGACNGGTTGCTNGGAAAGNATGGNNGTTGGCCA 
SEQIDN0663 

TTNNAATAGCCATACAAGGTATATCGGNGGTTANTGCATGTTTTTNAACTTATGGNNCACNC 
ANNATTGTTGTTGATCCANGGTCACAAANAGNCAAGCNGTCANGNTGNANGAGANAANTNAA 
NAATGGAGGCANATGTGGNGATGTANNTACCAGTTGTGAACAATANGACATGNACT'GTTCGN 
CATGATTGGCACNATTTGTGNGGNGAATCCNAAGCAA 

SEQIDN0664 

GANGACCCTATGCTGATGATCCCTATGCGTTTGGCTAGAGGTGAAGATGTCCCACTCCAGTG 
CAGAGCTTCCTAGAGAATCTGAAACTTTGACCTGGAAATGTGTGTGCGCTGATTCTTTGATT 
GCAGACGTATAGCTGGCTGCTTTCCACATTGCAAGGAACTAGAATTTTACTTCCCCCAAAAA 
TAAAACTGTATATAACTGCAA 

SEQIDN0665 

CCCTATGCGNTTGGCTAGAGGTGAAGAATGTCCCACTCCANGGCAAAGCTNNCTAGAGAATC 
TGAAACTTTGACCTGGAAATGTGTGTGCGCTCNACTTTGATTGCNNTACGTATAGCTGGCTG 
CTTTCCACATNGNNAGGAACTAGAATTTTACTTCCCCCAAAAATAAAAC.TGNATATAACTGN 
NATTACTCAGGACTCATNATCCTCCTGCTCAAGTTGCTCAAGTTCCTGGAGCAGAAGTGATC 
CCTGCTCCAGCTCCTACTGGCTGGGAATGAGACCTGCTTCCTTTAGAAAGTTCTTTTTGA 

SEQIDN0666 
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GANNGNCGTANACGAAGNCAGGGGACTGAATCATNAAGTATGCACAACGGAGCTCTATTTGT 
TNGTTCCACCNTGTGTTGGGNGGGNGGAGTGGCTNCCTANTGATATGTATGTATNNTNNGAG 
CCAAAGNTCATATTATACTTAANCCTACTGNGCNCCTATAAAGAGAATGCCGCGAGATTCAG 
AAGATGCTTCTGATCTGTGA 

SEQIDN0667 

GGAGGCTTVATAAGTTGAAGGCATTGCAGAGAGCTGCTGCTCGAACCTCTCATATCAAGTCTA 
CGTGATGGTTTTCACATAGAGCTCCATAGAGGTTTCTAACTAATTATATCCTTTCTTATTGT 
AAATGCTTCAGATTACCTTCAATCTTGAACGTCCAGAGACTTGTCCAAATGATAAATCTTTT 
TACTCTTTCACCCAAATTGGATGTCATTTTCA 

SEQIDN0668 

AATCTGAAGGGTCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGACATACCAAGGAGGTGAG 
GCTGAAAGTAAAGAGAATGACGACAATTCATCTATGTGGTCGATTCAAGTGAATGCAAGTAC 
TAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGAAGAAGAATATGGAG 
ATTTAGTTGATGAACTGTGTGAAGCAATTAGCAAGA 

SEQIDN0669 

GCCANCCCAGTCGACAAGACCAGCGCCTGNACGTAAAAATCTGATACCTGACTAAGCTTATG 
TCCTGAGGGAGCCAACCTCCCTCAGGCGTCTGTTACTACCTGCTGGCTT 

SEQIDNO670 

GCCGGCTCTGNGTCCACCTGACTATCAGAAGCGGCNCAGATGATTGCATCTGTATTANAAAC 
AANGGAATCTCCATCTTCCATGANTGNGCCTATAGACATCTCTCTATAANTCATTTTTTTTN 
CTTNNNCANAAATNGNCGGAGATACTNTAGCTTCATNANTNGT 

SEQIDN0671 

GGGCAAGTGGATGGTGGGTACTGNCNCGTTCGGAGCTCGAAGGTTTCTGNNNCTGGATTGNC 
TGTCTATACCATTATGTGATGTNACCNAGATGGCATCGCATCTTGAGGCCCACTCTCATCTN 
GCTTNTG 

SEQIDN0672 

GGNGCNATTGCCNAANTGTGCTTCTTGCTGGATATCATGTGTGAGTGTTATCTTCAAGAACC 
TCACAAATTTGTAGTTGATCAGAATCTTTGCAATGCGTTTTCTCATTTTCTTTCATTTGTGC 
TTCCTTTATTTTGTCTTTTACG 

SEQIDN0673 

GGTGCTGAATTGGAGGAAGGAGAANAGGANNNGGANGAGGAATGCCTAGNNGNNNGNGTGCA 
TAGANTCCAACTGAGTCACGCAAGAAACCAGTNTGTTCCACTGNTTGGCTTNCTGCTAGGGN 
TGTTGAGTCTTTGAATAAGAACGTTGATGGN 

SEQIDN0674 

CACCATTCTTGATCGTAGTCCGAGATTCCACGGTGAGCTGCTCCCTTCCTATGTCGTTCAGC 
AGCATGATGGAGTCTCTCTTTGCTTTTGGTTGTCTATTCTATTTCAGACAGTTGGATAGATT 
TATTCTTTTATATATTCTGCTAGATGCCCATATACTTGTGACACCAGGTCTTGACACACACA 
TTAGTAGACTATTCTTTTGGGATTGTATAATTATTATTGTACGTTGCTAATTATCACTTGGT 

SEQIDN0675 

GGGGGNNGNTTNTCTCTCCGCTGGAAANNTGANTGACTTGGGTGCTAANTGATGGNAGACCN 
ACACACCCAANAAGGGNAAGNGGAAAGGACGACATGGNTCAATAGCNCAGNGAGGGAGACAG 
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ACGGAATGAAACGANNCAAGANANTGGGGNNACCNTGTTCTATTTANTGTGNNAGNNNAAAC 
AACCCACGTTCCTNACAAAACAAACAGTATTTTGGATCGGAGACTAATCTGAATTTTCCAGA 
CGAGTTTTTTNCGGTNAATCTNGAGGTTCCGACATGGNTTTTTG 

SEQIDN0676 

TAGGGAANCNATNCTCATTTGTTATGACCACCATTTACTTAGCT 
SEQIDN0677 

CTCNGNTANCAACACGGCTGGATAAACTTCAGNGCTCCCGGTGTGGGTCTATTTATCGGAGT 
TTGAGCACGACNNACACCCCGGGACCATNTAGNTAGGATNGCTCATTCANGAATAGC 



SEQIDN0678 

TCAGAATGCGAATTTGCCTACTCAAATGAACGAGATTCCTGCTAAGTGGAATGGCAATCCGG 
AAGGTTGTAGTTTTGTTCGTCCAAGCTCTTTCTCGGCTTCCTCATCACCTGCAGGTCCTTTT 
AGATCATCATCTTTGTATTATTCTGCCGGCTTTTCATAACCAAGAATGTTGCCTTGCATGGG 
CATTTACTCTCATGACAGACAATAGAAACCTGACGCTTACAAAGCATAAATATAGCAGTCTG 
AACGAAAACACACACGGCAAGTTTGAGCAGATGAGTTATTCTAGATTTGCAGGTTTTGCT 

SEQIDN067 9 

TTTGGCCATACAAAGGGNTGAATATGAGGNATATGGGGGGNTAGGCATATGTCGCACAAACC 
CTGGNAT 

SEQIDNO680 

ACCCTACCGGGAGGATCATATGAGCGTGGGTTCTACTGGCCTCGACGTCCTCTGTAGTTGGA 
AGGGAAACCAT 

SEQIDN0681 
GGTGTTTTAGGTTGTCT 

SEQIDN0682 

TNGCCCCNGCCAGTCGGACAGAANCGGNTAGNACCGAAGNCNATNCTGCCACGGGCANGGAA 
GACGT 

SEQIDN0683 . 

AGAGGTGGTGGGACTGTTCGTTCGGTGCTCGAGGTTTCTGGTTCTGATTTCTGTCTATACCA 
TTATTGTTGTAACCGAGATGGCATCGCATCTTGAGGTCCACTCTCATCTTGCTTATG 

SEQIDN0684 

GCNGNGNNCAAGGNGGCTACCTGACNTNACTNAATAAATCAANCTNTTTGAACTCAGGGTNT 
ATAGGANGAGATGGAGGCTCATGCATGGTTGACACCAGGGTTACTGGAAAGANGGTTTATCA 
TCCAAACCATAACATTGACACTGAGGATGATGCACTTGCGCTGAAGTTGTCATCAACCACAA 
CCATTGCTTCAGATAATACGAGCTCATTATCTAATGAGGAATCAGCAAACTTAGCAAGTGTT 

ACTTCACTTTCTG 
SEQIDN0685 
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CGTTACATATTAGGAGTATAATTTTTTCATTACTAAAGCATGTAAATATGTTGCTCCGGGCT 
TTGGTCTATTAGTAAGAGCGCAATGCGTGATATGTGGG 

TCNAGCAATTANNNNNTTTGGCCTGCNGGTNCCTNTGGCGCTGANGATCTCTATGCCCCGCC 
GGCAGACGGTGGATTGGATGATGACAATGCTCACG 

4 

CCATACANTCAACCTGGCTGGNCCATCACTTAGGGCNNNGTTTCAAGATTATCCAACTTGGG 
NAATACTTATCCGCCANGATCNATAGCCGGATCAGACNGACG 



TCACTATGGAGSACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGGAAGT 
TATGGGCTACTGATAGTAGTGGTTACTCTGG 

CCGNGCTTNGGCGGGAGTAGCCTGTGCTCATTGACNGGAACCCGTNTCGCAGGACCTTCGCC 
rrrTOAGCCGTCGAG^CCTGicGTTTGATGAAIACCAACAGAAGAAAGCCGAACTGACCGGC 

tcgggtttttgntttgccatcccgtttccggagcctg 

ccSnanSnacantgctggnncatnnacaacccggtgcgcggttcgccgttgcgcggcagt 

TCCGGC 

Scngg^aggtac^ 
aaac aac aac tt c c acc 

SEQIDN0694 
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TTTCTTGTTGCTCGTGAAGAGCCAATAACCAGCCCCCGCAGCTATCAACATAATCAAAAAAG 
AACCACCATAGATCAACCTGGTAGGCCCATCACTTAGGGCAACTTTCAGATTATCCAACTTC 

GAAATCTTATCCGCCATAATCAATAGCCGCTCAGACTGACG 
SEQIDN0695 

TGGTCGNTGNAANAATTTTGCTGGAAGCTTTGTNNAATGAAAAATTGNTGCTTCAG 
SEQIDN0696 

GCCAGCTAAGTGGCTTTATAACACCAAAAGAAAGAGGCCTTAGGACAACTAAATATGACATA 
CACTTAGACAACATGAATTTGCCAATTTATCTGTTACTATTTCCATTGACCTCTAAACTCAC 

CTCCATGCA 
SEQIDN0697 

NCGTAGCTATCTTTGCTGCTTCTTTGATGCTTTGAATCATCTTTGATCTGTGACGATATTTT 
GTGTTTTATTTCGCCGGAGTTGAACAGTTAGGAGTTTATTTATGGNTTTATTTTTCACTGTT 

TTTTGTTCATTCTTTTTTTTACTTCTTGACA 
SEQIDN0698 

GATGCATGTGTCACAGAAGAGATGCCATAGTTCCATATTAGGAATTGATAAGATGTGCTAAG 
ATCAATATAGGTCACTTAGTATTATTCTCTTCTAGGCACTAGTTTCAGGTCATATTTTAGTT 
TTATGGGATGCATTTCGTAAACTTGTTCTTGCCTTTCAGTTTCATTTTGTATGTATATGTCA 

CTGGTCCATATTGTTGTTGACACTCGGCA 
SEQIDN0699 

GATGGACGTGTTATTGGTGGTGGAGTTGCCGGNCTATTGGTAGGCTGNCAGTCCTGTGCAGA 
TTGTTGNGGGCAGCTTNCTTGATGGAATTCAGCTCGAGCAGACGACCAAGANAAACAAGTNC 
GAGCCCATAGNTGNAGCTGNTCCTCTATCTAGTACAGATATGGAAANCGCCTATNACTCATN 
ATNAGCANAACCAACTGTAGNATCGGATTCTTCCTTACATGAAGATAACTGNNCATNATTAG 
CCCNCGACTTGAGGAATANNCCTGCTGACATCAATGNATNTAANCCTGCATAGGTTTTTGTN 

GAAN T GN ANTTNATCN G 
SEQIDNO700 

NCTGAAGAAGGTCCTNTCGGGANGAAATAGCTAGGNNGTCTTNGNTTCANCT 
SEQIDNO701 

NCTGATTGTTCTTACAAATAGGTCAATCTTAGTCCAAGTAAGTATATTCTCTTACTTCTGTA 
NTTTTCCAGATTTGGT 

SEQIDNO702 

TAAGGGATCACGACCCTACGGGAGATCATATGAGCGTGGGTTCTACTGNCTCGACGGGCTNT 
GNAGNNGGANGGAAACCAT 

SEQIDNO703 

GAANGAAGGTTCCAAGNGNCTCCCATTGTGGAGCANTATCACCTACACATTGTAGGGCTAAT 
TATCTTTTCACTTCACNCGGTAGAGGANCAGATTGCATAGCTGT ' 

SEQIDNO704 

NNTGATGTCCCTTCCTTTCTGGTGTCGTATCCGGCTTTTTNCGTGGAAGCGGTGTTGCTAAA 
TCGNGTGTCCGACGGCCCTTTACTGTACTGGAACGACATTTCTCATTTTGTTGCTGCTGTTT 

ACGGT 
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SEQIDNO705 

GATATCTTCATCTTTGCGCTTTATGTTCTTCACATCCACAAGTATTGGTGTGTTTTCTGCAT 
TATCATTTCTCAGTAGTTTCCTTCTCTGTTTCTCC 

SEQIDNO706 

CGAAGAAGAAGANACTTACG 
SEQIDNO707 

CACGGAAATCACGCCGNCNTTGGTACCTTGACCGGGTTNCCTANAGGGNACTTCAGTCANTG 
GGNNGCCCAGNNACTGAGGNGGCCG 

SEQIDNO708 

CATCTGGATTNAACAATTTCATGGCCAGGTTTTCAAAAAAATAAAACAAGGTCTTCATGGCC 
GTGC 

SEQIDNO709 

NAACCTTACTGTACAAAGGAAATCATTGGTTGCTTGGGATAAAGTCTGCATGCCCAAAACTG 
AGGTGGCC 

SEQIDNO710 

CGGNNTTTTGACAAAGGTTCCCGCTTACACACTCCTCGTNCGATGNGCTCCCTGACCCGAGT 
GTTNTCGCGCAGCAGTGTCATGNTCAAAACCAGGATTGNNTTNAAAANGACAGGACTTCAGG 
TCATTNATTCCGCC 

SEQIDN0711 

ATTNTNNTTTTTGGAATGGTAAATACAGGTTGGATAGAAGCTTTCCCA 
SEQIDN0712 

GCAAATANTTATANGAAAAGGTCAAGGAAACACTAAGTGTGTCATAAATAGGATTATCTATT 
ANTA 

SEQIDN0713 

CTNNCTTTGNTNGACGAGAGTAANANCTTGGCAGCTATCTTCCAAGCCATTTTCAAGGGCTN 
TGCATCTGTAGTNCTNTGCA 

SEQIDN0714 

CTGATAGTCTGATGGGCTTCCCTTTGAGGGTAACCCGACCTTTCTCTCTGGCTGCCC 
SEQIDN0715 

GANCCAGCGGNANTAGCTGCTGTACTANNNACAGGNATCCAANATATGAAAGCT 
SEQIDN0716 

AGCACNTCCGGCTGTATCTTACTACCAGAGAAATTACAGNTGTGGACATATCTCGAAGATGA 
ATCAATNGAATATATCTNCTAATGAAATTGTCTTGCTCTTTNGTTGNGTAT 

SEQIDN0717 

ACTGGTCCAAAAGCTNCAAAAATTTGTCTAAGCTGTACTNTGNCATGNNGAAATGNAGATTT 
CCNACATAAAGTTTTCTCTCTGAGGCAGCATTTGNGCCTGCCAACCCTGANNCACCACCANA 
CGCAGTTGACTGAACAAGGGTTTTTTCAAGCTTCAGAANGCNTTACCAATNNTGGGTNGNCC 
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AAAAANCAAGGCANCGGGTAAANGAATTGGCCATNGGNCCAANCTTNGNNTATAAAANNNNA 
ANGTCCCNAANTCNTTTANATNGCNTNGAATNCCGGCCNNTGA 

SEQIDN0718 

GATTTAGTGATNAATTTCCAGCTTATTTTTTGNTGTGAGAGGAGNGCAGTATCAGNACTCCT 
TCTGGCGCCAGGATACCATNAACAGGTAGCCATCGAAGGTGTACA 

SEQIDN0719 

GGGAAGNTCCAAACAAAAAAGAAAAACGCAGTAATACCCTCCAAAAAGCTTCATCTTTCTCA 
CCAAAGCCTCTTTGCTTTGGCCATAGAAACCAGTAACCATTAGCTATGTAAAACCATTGCAG 

CTACCATTTTAGAAACAGTTTCGAAACGCCA 



SEQIDNO720 

AGCCCTTNGTCAGCCCACCTNTTATGCTCAATCNCACCGNNAGGAANNCTGNNAGAGTTANN 
GANGCGATTGATTNCNGCNCTGACAGATCATATNGCTTCTATAANNGTTGNGCGGACACGCG 
AATNAGNTTNCTTACCCTCGCATAAGACANATNCTGATCTTACCAACCACTCATTAGATGTG 

GN AC C T AC AG C AN C T AC AT C T T C T AC T GC T G C T AAC A 
SEQIDN0721 

AGGCATTNTNCAGNGCGCCACAAGTATACTGGATTTCCCGGAGACATGTGACTGGAGANGCA 
TCACCGCAAGATTTGTCCGCTCAAACTCTCATTGATGCTGCCATTGNCATACAAAAGTGNAT 
TCAGCGGGTGGATAGTAAGGTCTTCTCTTGTAAGCACGGACCAATCTCCTACAGTTCCTAAG 
GAATGTGAAGAGAATATAAATGCAGCGGNAGCAATCCAACATGCTTCAAAGGAATATACA 

SEQIDN0722 

GGAAACATACAACGAATGCCAAAATCTGCCATTTTGA 
SEQIDN0723 

CTTTCCGAATGCTACCNGATNGTATCAATTGGGGTGAACTGGTTGGGGTTTTTTTCCCCCTT 

TACC 

SEQIDN0724 

TCNTTGCNGANATCTACCTTACATGTTCCTGATGCAATCATGAGTTACTCTGATTTACACAT 
GGGTTGCTGNGGGCTGATTCCATGTCC 

SEQIDN0725 

GCCTTAAATGGTGTGTTCTAACAGGCTTATGGGTATGCTGGCATTCTCCATTGCTGGGCATA 
CCCACAGCCTGCCCTTGCCTCTTTCAATTTTCCTATTCCCCC 

SEQIDN0726 

GCGNTACTTCANAGTCNNGGANAGAGGCTAAGAGGNCNNACANNAANTGCTTCAGTACTAAT 
GAANCANATNCTNGNNTCTTTTTNAGGGACATANCAGGTTTTGACAAGCCCCCACATGAATA 

AGAATATATNANACTTCTCTAACC 
SEQIDN0727 



FIGURE 4 (continued) 



135/140 



TTACTGTTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACA 
ACAAGCTTTTGGTTACTTTTTGTNAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATG 

CTATGGTAGTGGTCGTATCTCG7\AACACGTGATATTTAGTGC 
SEQIDN0728 

GCCTTGGTAAGACATTCGTGAAAAAACTCTGTTATTTCTTAGAGATAAGGTGGTTCCCGC 
SEQIDN0729 

GACTCTTGCCAAAATTGTATCTAAATCCTCATCTTCCTTTGGGATTGGCCAAGATTGGCTGG 
CAATGTTGGGGCATTTTTGTTCGAGTTGTTCATGTTGGAC7\AGTGACTACCTGATTTAGATG 
TTGCAGAGCT^AAGCTGTGCGATTGTGTTGTATGTTATTCTTCTA 



SEQIDNO730 

GCCCGACGCTTGAAAGATTGCATTTGGAGA7VATGCCAATTGAGAAATAAGGAGAGCTTGAGA 
ACATTGTTTCTACTCTGTCAAGACGTCAGAGAGGTTATTTTCCAGAACTGTTGGGGACTGGA 
TAATGAAATGTTCAGCCTTGCCAGNGTTCTAAGGAGAGTGAAGTCCCTTTGCCTGGAAAGCT 
GTTCACTACTCACAACTGAAGNCCTTGAGTCTGTCCTCCTTTCATGGAAGGAAATCCAGAGC 
CTCAAGGTGATTTCATGTGGCAATATAAAGGATAGTG7y\ATCAGTCTAGCACTGTCTACCTT 
GTTCTCCGCACTAAAAGATTTACAATGGAGACCAGACTCAAAATCTCTTCTTTCAGCTGGTG 
TTGNGGGAACTTGCATGCGGAAAAGAGGCAATNAAATTTTTCAAAGAAGACCGTGNGACTTG 
GAAGTCACTTGCCTGGGNGCATAGACTGGCCTTCCTCATCATGCATTCAGGACAACTCTACT 

ATAT 

SEQIDN0731 

TAGGTTTTGTTTAGTGTTTTCTT^AGTTCTTGTTTT 
SEQIDN0732 

GAAGCNACTAGTTCAAAATATGTGCAGTGTTGATCATATTCTTTTGTTATGGCCAGTTTTTA 
CCATTTGTTGGACACGTTTGATGCTGT 

SEQIDN0733 

GCCCGTANNANGGGTTCCNACACCNNCNTANGGTCCTNTTTTCCTTCTGAATNGAGCCTGCG 
ATAAACTCCATANANAACTAGCAAAAAGAGCTCCATTTTTTCACTAAAAACAACCGTTCAAA 
CAGCTATGANAATCCCTCTATCTCCATCAAAACCGCAGCATCCATCATCCTCAATAAAGGGC 
TGCACAAACCTGCTACAATCAGCATAAAAACAGCCCTGAAACTAGCTTCTTTCGAGCTAAAA 

TCAACT 
SEQIDN0734 

TGTGCTAANGTAGCCCGNTCTTATCAATAAGTGCAAAGTTTGG 
SEQIDN0735 

TAATAAAGCCCCGGGANAAGNNAAGAAAAAAAAGAGAAAAAGAAACTAGGCCGGGTCAAGGC 
AGGCCATATTGNNAGCACTACTGCCTGG 

SEQIDN0736 
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TGNGACCTTTTGAATCTCCCGAGTCTGNAGGTCTAGTTTACTCCCAATAGACGAGTATCACT 
ACAAGTCTACTGCAAATGGTTGATGTTTGATGTGGGAGACGAAACGATAAGCAATTTAGTAA 
CATGTGTCCTTTTTCACGTATATATAGATAGAGCAAGAATGAAAATGGAGACACCTTTTCCA 
TTTTTGAAGGATATATTGCTGTTTCTTCCCTCAAAGAGAGTTTTGTGCACTATGTTTGGTAG 
CTTTTCGAGAGTAGTATGTTTTTATCTCGTTGAAGCAACCTCCTTTTTTCCCCCTTGACTAG 

TTGACTTGAAGGG 

TGTTTGTTTATGACCCTGCTGGGTCATTGGTAATTATGTGTTTAGTACTATGTCTTGGTGC 
SEOIDN07 38 

CTTTCGTCGGAGCTTTNGCCGCCGCCGGCTACCATCAGTACAATCCTCCGNGCTGGGCGCCC 

TG^TCACCTTTTAAAACCACGGGNAAAAGTAAAAGTAAAAAAAAGNAGGAAAAAG 
AGGCCGGGTCAAGGGCAGGCCATATTGACAGCACTACTGCCTCG 

TGCGAGACATTGCAACTAAGCAAGCTCTTTCCCTACATTGNCGTATCCCAGCACACAGATAT 
CACGGGGCATGGAGCCATCCNNCAGTGTCAACCAGTGCGCTATATAGGCGGNGACATGCGGC 

GCG 

SEQIDN0741 

GNNNGNGGAGNAAGCAAGCATAGAAGGAGCAAANTGTTCATTCACTGTGAGTANGAAGACAA 
AG CAAGAAATAAT T C AG AAG CT GAT T GAAATAGTAAAT GAAAT AT CAAGCA 

SEQIDN07 42 

CCCTGCCTGGGAAATGGTCAATTTGAGGAAGGGCATTGGCAGCTAACTTGTTATATGCGCAA 
AGTCTTGTATGACATAGAAGTAGATGGCAACAGACAACAAGTTCCACCAGATGATTCCAAGG 
TTCAAACTCAAGAATCAAGGTGTTTTTATGAAGGAAATCAACATAATGACAATGATGGCTAC 

TGGGACTATAACTTCTTATTTGGAGGTGCAGGTGGAGGAGAACATAG 
SEQIDN0743 

CCTCAGTCTGAAAATTCCAACACCAATATGCCCCAATTTGATTCTAGCTTGACCCGTAACAA 
TATTGGATCAACCCCATATTATGGAAGTCATGAAAACATGACATCAACTAATTACCATATGG 
NGANTTATCATAATATGGTGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGT 
TCAATAGATTCTTATGACATGCAAACAGATCATCACAACAGTCGATGGACATTTCAAGATGA 
TGGAGATGACCTTCAGTCAGTGGCTTTCAGATATCTTCAACATTCTTGATCAGTANNTAGGN 

CTTCAAAAACAAATCATGGGTGAAGA 
SEQIDN07 4 4 

TNCTNTCATGGTGNGCCTACATTCNGGACACNGTANTGATCCTNGCCAGCANGATTGTCTTA 
CGCTACTACANTTGGANCGATNNGCCTTACCTGNCGGTTTTANTNNGAGGACAATAAGNTCG 
ACCNTCCNATCTGCCTGAGCATTNNNNCTATGATGANCGATNGGGAGGNCATTGTGCCATCT 
GCGAGTTGAANGATTATCCACAGTGAGAGCCGGAAACCCCTGCAATNCNANANTCTGGGT 

SEQIDN0745 

CCTTNNGTGGCTNGNGNTGTGCTCTGCGT 
SEQIDN07 4 6 
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GNTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACAACAAG 
CTTTTGGTTACTTTTTGTCAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATGCTATG 

GTAGTGGTCGTATCTCGAAACACGTGATATTTAGTGC 
SEQIDN07 4 7 

NCTTTGAATTTGAACCACTACCTAATATGAAAGAATGCCTGCTCGTAATGAAATACTTGTCA 
TGGTGTCTCTACCGAGTCCTTTGGCTAGGGCAACTCAATCAATATGCAGTCGTAAGAATGTT 
TTGAAATGCATATGTAGTCATCATCGGTGTTTTCACATTTATGTGAATTTGGATGTTCG 

SEQIDN07 4 8 

CCTGCTTGAGGTCCATTCTTTTTTCTCCTTTNTTTTAGTTCGAT7kACACTATATGCGGGTCT 
CTGATGGTTGTCGCGTNTTTTTGGGTGC 

SEQIDN074 9 

TTTGGAATACAATTCAACTTCTGTTTCCTAAAGAAATAGAAGCAAGAAAAGCAGCTGGAGCT 
TTGAATAGTAGAGAAGCTCGACGCAAAAGTCCAGTAAGAGCTGCTACAGCTCATTCTAACAT 

CTCTAGCAGCAGAATATCAAGAGTGTTCGCGC 
SEQIDNO750 

CAGTATCCCCCTTACTTGTGTCAAATCANCTTNTCCCAGTATGGCTTCCATATTTTGACTAC 
AATTCTTATCAGAAGGCATGATAGTAATAAGTGACAAAGATGCAAAAAACATAAAAGTTGTC 
CTTCACTTTTGGTTAGAGGCTGAAGATGAACTTTCTAAGTTGGACA 

SEQIDN07 51 

TTCGATCGGTGAAGCTTCTTTACCAAC 
SEQIDN0752 

TACAAAGNAATGCNGTNCCAAAATACATTGAAATAATTGGCAGCCGAATACTAAACTTGATC 
ATGT 

SEQIDN0753 

CCCGAATTTCGTCCGCCAAATTGTCGTGCATAGGAACAGAACGAGAGCCATCAATGCCGTAG 
GCGCCTTTCGCGTACCACATGACCCGAGAAAAAACACCGGAAAGGATTTCCGTGATTTGTTT 

CTCGGTGTAGCTACGCCGCA 
SEQIDN07 54 

TTGGTNTTGNCACCTGCNAATGGCNNTACATGGAGCAGGGACGNNAATAAGTGGNACGAGTG 
ACCACATGAGGGAG 

SEQIDN07 55 

CATCTCNTCCTCACTTCTTGAACTGTACGCCCACCCTTTTTCTTCTTGGNTNTGTTCTTANA 
AGTTTCTGGCACCTGCTTTTTGCTTCTATTATCATCAGCTTCTTCAGGA 

SEQIDN0756 

NACACCAATATGCCCCAATTTGATTCTAGCTTGACNTGTAACAATATTGGATCAACCCCATA 
TTATGGAAGTCATGAAAACATGACATCAACTAATTACCATATGGAGANTTATCATAATATGG 
TGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGTTCAATNGATTCTTATGAC 
ATGCAAACAGATCATCACAACAGTCGATGGACATTTCNAGATGATGGAGATGACCTTNAGTC 
AGTGNNTTTCAGATATCTTCAACNTTCTTGATCNGTNNCTATGNCTTNAAAAACATATCATG 

GNTGATGA 
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SEQIDN0757 

CTTTGGGGCCGTTCTTGGNATCCGTCGAACTAGGGTGTTGAAATTTCTNTTTTTTCTTCTTT 
ATTGGGTTCTATTATCGATTNCATGNGATATTTTATTTCCTTATTTGTGTTTGAGTAATNGT • 
TTTCCATGTTTGCTTGTTCGATTTCTACCACTATATAACCCCTCCCCAATTACCCTTTTGGA 

CAGACC 
SEQIDN07 58 

GGTANCTCTNGGNCTGCGAANANGNCTCTNAGCCTTNCNCAAGCGNGCGCGAGAGAAGCGGC 
NNACNNAGCTACCGNTTCACCCGNCCGACTAAAANACAACAGNCGCAGACCTACTTTGATTC 
ANAAGAAAGGNGACGGNTTCGCNAACANGNANNCGGNTTTCTATCANAGGTGCNAGGGTTCC 

AAACC 

SEQIDN07 59 

CCTNTGGNGTTCTGNNAATTCTTGTACACANAAGGGCAAAACAAACAAAGGAAGAGCAGCAA 
AGTATGAGTAGAGCTTCAGTAGTACTAGTAGCTATTATGGTNGTGGAA 

SEQIDNO7 60 

GNGGCATTCGGANCGATGGATTGGTCTTCATAACATTCATCATCTTTACATTGCAGCATTTC 
AGAAG 

SEQIDN07 61 

TCAAAANTANTNNCNTNCTNGNNCTGCACATTGAGCATGTGCTCANCAACCTNTNTTGTGCT 
CNNTNTTCCCCTGAACATAGNAGTATGCAG 

SEQIDN07 62 

• TAGNNCCTGAGACNNAGNAAGAAGACAGACNGTCACTGCAACGCCNNANGNGAGCATGACNN 
GANCNGNGGNAC 

SEQIDN07 63 

GGCACAAGTNNAANNGCCTGTNTCGAAGGTGNGGCAACAACC 
SEQIDN07 64 

CAACGTAAAGGATTCAATTCTTGTTTTGTTTGTTCATCATTGAAATAATTTTTTTTTAGTCT 
TGCATTATATGTTTGGTTGGT 

SEQIDN07 65 

GGCTTGGNGGNNGCGGGTGNCCACCATGNNATGCATACANTATNCATGTANGNNGCTACANA 
GACACATTNGGAATAATGNGTCGGATCGNTTAGNNNTGGG 

SEQIDN07 66 

CNCGATTNNATACAACCCTGAGAAAAGAATGTTAAAAAATGACTATCTTTTGTAAAGAAACC 
CCTTTCATTTCCAGGCAATGCAAGGGGGATCACAGTTTTACATNGTGGGTGTGGTTATTTTA 

CGTCACAGTT 
SEQIDN07 67 

ACGATCGATNANGTGGNCTNGNAACATTCANCATACTTTACATNGANATNTCANAGGTTACN 
CAGGNCTCATCANTGGNNNAGCCTNTGCTCANCG 

SEQIDN07 68 
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TCGCACACAGTATCATGAGAATNNTGGNCTTGTCATCCTCAAAAGAATCCTGTNANAGCATG 
G 

SEQIDN07 69 

NTCTACAATNGCATACANCATCAAGCATAGNCAATCACAAACATGTCATGTANAAGTCCTGA 
AATTTCGATGTCAGGACTAAGCTATAAGNACTACTACATGGAAAGCATATATGTGCATTCGT 

NGTCCAAGCAT 
SEQIDNO770 

GAGCCTGCTGGATCTTCTTTCTCTTAGCAAAGAGGAAAGGAAGAAACTAGTCGAAGAGCGCC 
CTGGAATCAATAATTCTACTATTACTGCTCTCATTTCTCTAAAATGGAAGGAATTGAGTGAA 
GAAGAAAAACAAGTGTGGAACAACAAAGCAGCTGAAGCATACAAAAAGGAAATGGAAGAGTA 
CAACAAATCTGTAGCAGAAAAGCAGAACAACAATTAGAAATAGTAGAAATAACTATAATATG 
TTC7\ACTGATTATGTTGAACATAGAATGATTGCTAGTTAGTTGAAGTAGTAAATAGGTATCA 

TTCCAATTTCCTTTGTTGTTTAGTAGCAG 



SEQIDN0771 

TCCGNTGCAANCGGNNCTTNCACNCTTAGCAANAACACNNTNCTGGGGATTNNAGTCATGCC 
ACAANTAGCAGGGGCTNAGNCGNCC 

SEQIDN0772 

GGTTCTNCTNTNNCTGCTGCGCCTGACAGCANTTGTGTGGNTCTGNCGCTGCACNCNNCNGC 
TGTNTACGCNGGAGGNGNAAANGGNTGNNCCTGNTNNGGAGTCACATGATGACANGNGTNAN 

ANNTNGTTNNA 
SEQIDN0773 

ANGNGCTATATCTTCGNNAGAAANACTGCTGCGCAGTGTGNAANAGCGTGNNTTCACGGTAT 
GNANGGNNGATNNNACTNTGCAGNAACTNCNA 

SEQIDN07 7 4 

CTGTTGNTCTTTGGNCACATGATGATTCAGNTTGNNAAATNTGTGG 
SEQIDN07 75 

ATAGTAlACGTGCCTCTTTGTTTCTGCNNTCAATTNGGCTANAGTCNAGTGGAGTAACGCGTG 
NGCCATTNTTNTNGAAGCTGTCGG 

SEQIDN07 7 6 

NTTTATGCCGGAANAAAGNNAGGCNAGNATGCAGATGCNGGNNACATAACGCTAATATGNGG 
ATGAATNAGGACNAGCAGCAGTGAAACTCCTTCCC 

SEQIDN0777 

NGAGTNAAGGGCCANTCTGAATNTGGCCTAATNTGGNT7\AANNGNGGGGAGTANGCCGNACA 
NANTNATTCTTGTGGNTGGNNNNNCGTTNA 

SEQIDN0778 

CTGATATGGGGATTNNGAGGCAAGGGGTATGGGGNATCATGAAGNTGGTTGCAG 
SEQIDN07 7 9 

GANNAGGCGCTCCCTCCTTNCTTTGTGATGACANCNATNGAANGAGAAGACTCCTA 
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SEQIDNO780 

GAAGCATAGCCCNGCGCNGNTNGCGTNAATGAGANCACAGATGGNNCTAAAANATGANTGNT 



SEQIDN0781 

CCGCCTANTGCCTGTTAAGTCTAGCAACCTCCTCNAGAGTTNGGGAATTCACAATGGCAGCC 
SEQIDN0782 

GTANGGCCGAGTNAANGGTAGCAGAACTTNGAATGTGGGACNNGAGNGTACAANGCGTCNGA 
CANNGACTTNGTGTANANNC 

SEQIDN07 83 

GGNAGCGCTAGATGANCAAGACACAATTGATATGCAGTCTTAGGAANCTAGAGAGAGANTGT 
AGANTANGGTGATGAACGCACNTNGG 

■ 

SEQIDN078 4 

TATTTNCCTGCGTGACCTAGTAAANATNGATAGGCCTCNANAGGTGGGGTTANTNAGGNCTC 
ATCAATNCCNAGACCCAAATCAGGCAATC 

SEQIDN07 85 

AAGCNGANNGACCTGTNTTGCACCTNAATATCCNNAGCCAAGGAAGANNGACGNTGGCTGGA 
TGANNNCAATNCTTNNANNAACCANNTACTGNCCN 

PRIMERS 

SEQIDN0786 
CTCGTAGACTGCGTAGT 

SEQIDN07 87 
GATCACTACGCAGTCTAC 

SEQIDN07 88 
GACGATGAGTCCTGAG 

SEQIDN07 89 
TACTCAGGACTCAT 

SEQIDNO7 90 
GACTGCGTAGTGATCNNN 

SEQIDN07 91 
GATGAGTCCTGAGTAANN 
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(54) Title: IDENTIFICATION AND VALIDATION OF NOVEL TARGETS FOR AGROCHEM1CALS 

(57) Abstract: The invention relates to a method for identifying and validating plant targets for agrochemicals, comprising the steps 
of determining gene or protein expression profiles in function of the progression of an essential biological process in a plant, and the 
^ subsequent downregulation of expression of said gene or protein in a plant cell. More particularly, the effects of downregulation of the 
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^ The invention also relates to plants tolerant to agrochemicals such as herbicides or pesticides. 
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